Faiqa Shabbir

01 About me

Hey! I'm an AI & LLM Engineer and full stack developer who obsesses over production systems — not just demos. I've spent 5+ years building things that run at scale: RAG pipelines, multi-agent workflows, computer vision systems, and MLOps infrastructure.

Currently at Turing as an AI & LLM Engineer and full stack developer, building high-fidelity computer usage datasets for GUI-based AI agents, and doing LLM research on agentic reasoning frameworks (CoT, ToT, Plan-Execute).

I care deeply about measurable outcomes — p95 latency, retrieval hit@10, cost/request. If I can't measure it, I'm not done.

Based in Pakistan 🇵🇰 · Open to remote globally

Years building production AI systems

850+

Long-horizon trajectories engineered at Turing

2,500+

Evaluation scenarios for model safety

40%

Rework reduction via structured mentoring framework

03 Experience

LLM Backend Engineer

Aug 2024 → Present

🏢 Turing · 3 concurrent tracks

Track 1 QA — Computer Usage Annotation

Engineered 850+ long-horizon trajectories (100–200+ actions/each) to train GUI-based AI agents for complex software navigation across DataGrip, MATLAB, SPSS, Tableau & Colab
Built Parent-Child alignment framework with 2,500+ evaluation scenarios covering Critical Mistakes, Side Effects & Misunderstandings — directly improving model safety & reasoning
Developed and enforced the S.T.R.U.C.T. QA protocol → 100% accuracy in event logging, coordinate metadata & trajectory validation across Windows, macOS & Ubuntu
Mentored engineers using Where / Why / What feedback model → 40% rework reduction and +15% weekly throughput

Track 2 AI Systems — Internal Portfolio

Creds Creator & Proposal Producer — AI automation suite using GPT-4 + Google Slides API to generate compliant consulting proposals; includes a Validator agent for tone, formatting & quality control
Sales Knowledge Center — real-time data pipelines for company intelligence & news, enriched with live web search to surface executive-level insights on demand
Advisor Insight at Scale — RAG platform (LangChain + FAISS) enabling multi-format ingestion (PDF/DOCX) and context-aware financial Q&A; supports retrieval over large document corpora

Track 3 AGI / LLM Research

Fine-tuned models (SFT, RLHF) for code translation & domain reasoning — improved explainability and structured output quality on benchmark tasks
Created taxonomy-aligned training datasets to enhance LLM performance across classification, generation & reasoning tasks
Designed agentic reasoning workflows using Chain-of-Thought (CoT) and Tree-of-Thought (ToT) — improved multi-step problem solving under uncertainty
Implemented Think-Act-Observe and Plan-Execute frameworks to simulate real-world decision making and evaluate autonomous agent reasoning in complex environments

Data Scientist

Oct 2024 → Aug 2025

🌱 Fehmida AI Startup

Architected RAG-based document analysis systems using LangChain + LlamaIndex + Neo4j Vector — improved retrieval and contextual understanding across financial, sustainability & research reports
Integrated GPT-4 with Neo4j Vector for real-time Q&A → +30% insight extraction accuracy; enabled precise financial analysis workflows at scale
Automated data scraping and processing with Selenium + Finnhub API — extracted structured datasets from multiple sources to support AI model training and analytics pipelines
Designed thematic modeling workflows using embeddings + clustering — automated document categorization, enhanced search accuracy, and improved user discovery experience

ML Engineer — CV & NLP

Jan 2023 → Aug 2024

🎓 University of Gujrat

Developed real-time web apps integrating Hugging Face vision models with React UI — enabling interactive virtual try-on experiences for end users
Automated document extraction & compliance workflows using YOLOv8 + OCR (Tesseract) + NER → 95% OCR accuracy, 90% NER precision
Delivered construction analytics: YOLOv5 object detection + OpenCV depth estimation — improved site monitoring accuracy and automated safety reporting
Optimized Flask microservices architecture → 40% faster API response time and improved production reliability
Deployed scalable systems: Python · FastAPI · React · PostgreSQL · Docker · AWS · GitHub Actions CI/CD

Client Engagement Lead

Jan 2021 → Dec 2022

💼 Freelancer · Client Engagement

Designed & developed multi-tenant Learning Management Systems (LMS) and food ordering platforms — secure access controls, optimized backend logic, and scalable APIs serving multiple client organisations
Built UI/UX prototypes in Figma — translated client requirements into intuitive interfaces that improved adoption and engagement metrics
Implemented backend services in Python · FastAPI · Flask · PostgreSQL — modular, production-grade architecture with full test coverage
Deployed on Docker + AWS with integrated CI/CD pipelines via GitHub Actions for zero-downtime delivery
Applied Agile/XP practices — rapid iteration cycles, continuous client feedback, and demo-driven delivery

04 Projects

🧠

AI Knowledge Repository

A deployed knowledge management system on GCP Cloud Run. Intelligent document ingestion, semantic search, and context-aware Q&A for organizational knowledge.

GCP Cloud Run LangChain Vector DB FastAPI

🚀 Live Demo →

📊

Presentation Co-Pilot

AI-powered presentation assistant. Auto-generates slide decks from natural language prompts, deployed on Railway with a polished frontend.

GPT-4 Google Slides API Railway React

🚀 Live Demo →

🔍

AI Stack Overflow Summarizer

Summarizes Stack Overflow threads using LLMs — extracts the best answer signals, strips noise, and serves concise dev-friendly explanations.

LLM Python FastAPI SO API

⌥ GitHub →

🪖

Helmet & License Plate Detection

Real-time safety compliance system using YOLOv8 — detects helmets on construction sites and reads license plates with OCR for access control.

YOLOv8 OpenCV Tesseract OCR Python

⌥ GitHub →

🏗️

Terraform Visualizer

Parses Terraform config files and generates interactive dependency graphs — helps DevOps teams understand infrastructure topology at a glance.

Terraform Python Graph viz React

⌥ GitHub →

🔧

Resume-to-Job Match RAG

Semantic resume ↔ JD matching with explainability. Per-field match scoring, gap analysis, and LangGraph improvement agent.

⏳ Building now...

FastAPI OpenSearch LangGraph RAGAS

⌥ GitHub →

01 About me

02 Skills & Tools

03 Experience

04 Projects

AI Knowledge Repository

Presentation Co-Pilot

AI Stack Overflow Summarizer

Helmet & License Plate Detection

Terraform Visualizer

Resume-to-Job Match RAG

05 Let's connect