Waqas Raza is an AI-Native Systems Engineer based in Lahore, Pakistan. He is Top Rated on Upwork with $175K+ earned from 168 contracts, 6,555+ billed hours, a 4.97/5 average client rating, and 13+ years of experience building production AI agents, RAG systems, SaaS platforms, payment infrastructure, fintech workflows, and Ethereum/Web3 products for global clients.

What does Waqas Raza specialize in?

Waqas Raza specializes in AI agent development (OpenAI, LangChain, LangGraph), LLM integration with RAG and guardrails, Web3 and Solidity smart contracts, full-stack development (Next.js, Node.js, TypeScript, Flutter), payment systems (Stripe), and data pipelines.

What is Waqas Raza's Upwork rating and track record?

Waqas Raza is Top Rated on Upwork with $175K+ in total earnings, 168 completed contracts, 6,555+ billed hours, and a 4.97/5 average client rating across 136 rated contracts. The site derives those public proof stats from the local Upwork history dataset.

Can Waqas Raza build AI agents and LLM-powered applications?

Yes. Waqas Raza builds production-grade AI agents with tool use, RAG, strict guardrails, and predictable cost controls using OpenAI, LangChain, LangGraph, and FastAPI. He has shipped a deep research agent, a DocOps automation agent, speech analytics platforms, and AI-powered pipelines.

Does Waqas Raza do Web3 and Solidity development?

Yes. Waqas Raza develops Solidity smart contracts on Ethereum and Base, DeFi banking platforms, ERC-4337 smart account systems, token launch studios, and milestone escrow contracts. He works with Foundry, Hardhat, and Ethers.js.

What technologies does Waqas Raza use?

Waqas Raza's core stack includes Next.js, TypeScript, Node.js, React, Flutter, Python, Supabase, PostgreSQL, Redis, Stripe, OpenAI, LangChain, LangGraph, Solidity, Foundry, Hardhat, Docker, AWS, and GCP.

Is Waqas Raza available for freelance projects?

Yes. Waqas Raza is available for freelance projects at 30+ hours per week with a typical response time of 0–4 hours. He can be hired through his Upwork profile at upwork.com/freelancers/waqasraza.

Where is Waqas Raza based?

Waqas Raza is based in Lahore, Pakistan. He works remotely with clients worldwide across the US, EU, and other regions. He is fluent in English and has worked with teams at Delivery Hero (Berlin) and Hello HD (EU startup).

AI-Powered Video Editing Pipeline — Case Study

The Problem

Video editing is one of the most time-intensive parts of content production. A 30-minute documentary-style YouTube video can take 20–40 hours to edit: transcription, B-roll selection, subtitle sync, colour grade, audio normalisation, chapter markers.

For high-volume channels—or creators who want to focus on ideas and speaking, not post-production—this is a bottleneck. The pipeline eliminates it.

What Was Built

The pipeline takes a single voiceover audio file (and optionally a text script) and produces a fully edited YouTube video in documentary style: B-roll, synced subtitles, transitions, chapter markers, and loudness-normalised audio.

The output is production-ready. Target benchmarks: Johnny Harris, Veritasium, ColdFusion.

Dual-Stack Architecture

The system is split into two runtimes that communicate via an EDL JSON (Edit Decision List) pivot format:

Python stack — intelligence layer:

WhisperX for word-level transcription (faster than Whisper, better timestamps)
LLM analysis of transcript → scene segmentation, B-roll keywords, subtitle groupings
Asset sourcing from Pexels and Pixabay with caching and deduplication
EDL JSON v3 generation: one record per segment with timing, B-roll asset, subtitle text, transition type

Remotion stack (React/TypeScript) — compositing layer:

Reads the EDL JSON and renders the video frame-by-frame
Word-synced subtitle animation (Hormozi-style: large, high-contrast, bold)
B-roll composited over the voiceover with Ken Burns motion on stills
@remotion/transitions for cut types (hard cut, cross-dissolve, wipe)
Light leak overlays and motion graphics for visual rhythm

FFmpeg handles the final pass: audio ducking (B-roll music under VO), loudness normalisation to -14 LUFS, and mux.

The EDL JSON Format

The EDL JSON is the architectural decision that makes the dual-stack approach work. It is the contract between intelligence and compositing.

Each entry contains: segment_id, start_ms, end_ms, vo_text, subtitle_chunks, broll_asset, broll_type (video/image), transition_in, transition_out, motion_preset.

This format is human-readable and editable. A creator can open the EDL JSON, change a B-roll selection or subtitle grouping, and re-render without re-running the AI analysis. Render-only changes are fast; intelligence re-runs are only needed when the analysis itself needs to change.

Professional Video Rules Baked In

The LLM is prompted with specific editorial rules:

Shot length: 3–7 seconds per B-roll clip (faster cuts feel more energetic)
Cut ratio: 90% hard cuts, 10% transitions (over-using transitions is a beginner mistake)
Subtitle grouping: 3–5 words per card (readability at speed)
Ken Burns: applied to stills only, with scale and pan direction derived from image content keywords
Audio ducking: B-roll music (if provided) ducks to -18 LUFS under the VO, then returns

These rules produce a consistent output style without requiring per-video prompt tuning.

Outputs

Each run produces:

final.mp4 — the complete rendered video
edl.json — the full edit decision list (inspect, edit, re-render)
chapters.txt — YouTube chapter timestamps derived from scene segmentation
assets/ — sourced and cached B-roll assets with metadata