✨ Available for opportunities

Faiqa Shabbir

AI & LLM Engineer Β· Full Stack Developer β†―

5+ years building production AI systems β€” RAG pipelines, LLM agents, Computer Vision & NLP. I design things that actually work, with numbers to prove it.

About me

Hey! I'm an AI & LLM Engineer and full stack developer who obsesses over production systems β€” not just demos. I've spent 5+ years building things that run at scale: RAG pipelines, multi-agent workflows, computer vision systems, and MLOps infrastructure.

Currently at Turing as an AI & LLM Engineer and full stack developer, building high-fidelity computer usage datasets for GUI-based AI agents, and doing LLM research on agentic reasoning frameworks (CoT, ToT, Plan-Execute).

I care deeply about measurable outcomes β€” p95 latency, retrieval hit@10, cost/request. If I can't measure it, I'm not done.

Based in Pakistan πŸ‡΅πŸ‡° Β· Open to remote globally

5+
Years building production AI systems
850+
Long-horizon trajectories engineered at Turing
2,500+
Evaluation scenarios for model safety
40%
Rework reduction via structured mentoring framework

Skills & Tools

πŸ’» Programming & Languages
Python JavaScript TypeScript Data Structures & Algorithms
🧠 NLP & LLMs
GPT‑4 BERT LangChain LlamaIndex PyTorch TensorFlow Fine-tuning (SFT, RLHF)
βš™οΈ MLOps & Pipelines
Airflow Docker GCP Cloud Run AWS
πŸ—„οΈ Data & Databases
PostgreSQL MySQL MongoDB Neo4j FAISS Redis Vector Databases SQL NoSQL
πŸ“Š Machine Learning
scikit‑learn XGBoost WEKA pandas NumPy
πŸ“ˆ Visualization & Dashboards
Matplotlib Plotly Streamlit Dash Tableau PowerBI
🌐 APIs & Web Development
FastAPI Flask Django RESTful API design GraphQL Postman
🎨 Frontend
React.js Next.js Tailwind CSS
🀝 Collaboration & Practices
Git GitHub Agile / XP
πŸ› οΈ AI Tools
Replit Cursor Bolt Winsurf Claude CodeAider

Experience

LLM Backend Engineer
Aug 2024 β†’ Present
🏒 Turing  Β·  3 concurrent tracks
Track 1 QA β€” Computer Usage Annotation
  • Engineered 850+ long-horizon trajectories (100–200+ actions/each) to train GUI-based AI agents for complex software navigation across DataGrip, MATLAB, SPSS, Tableau & Colab
  • Built Parent-Child alignment framework with 2,500+ evaluation scenarios covering Critical Mistakes, Side Effects & Misunderstandings β€” directly improving model safety & reasoning
  • Developed and enforced the S.T.R.U.C.T. QA protocol β†’ 100% accuracy in event logging, coordinate metadata & trajectory validation across Windows, macOS & Ubuntu
  • Mentored engineers using Where / Why / What feedback model β†’ 40% rework reduction and +15% weekly throughput
Track 2 AI Systems β€” Internal Portfolio
  • Creds Creator & Proposal Producer β€” AI automation suite using GPT-4 + Google Slides API to generate compliant consulting proposals; includes a Validator agent for tone, formatting & quality control
  • Sales Knowledge Center β€” real-time data pipelines for company intelligence & news, enriched with live web search to surface executive-level insights on demand
  • Advisor Insight at Scale β€” RAG platform (LangChain + FAISS) enabling multi-format ingestion (PDF/DOCX) and context-aware financial Q&A; supports retrieval over large document corpora
Track 3 AGI / LLM Research
  • Fine-tuned models (SFT, RLHF) for code translation & domain reasoning β€” improved explainability and structured output quality on benchmark tasks
  • Created taxonomy-aligned training datasets to enhance LLM performance across classification, generation & reasoning tasks
  • Designed agentic reasoning workflows using Chain-of-Thought (CoT) and Tree-of-Thought (ToT) β€” improved multi-step problem solving under uncertainty
  • Implemented Think-Act-Observe and Plan-Execute frameworks to simulate real-world decision making and evaluate autonomous agent reasoning in complex environments
Data Scientist
Oct 2024 β†’ Aug 2025
🌱 Fehmida AI Startup
  • Architected RAG-based document analysis systems using LangChain + LlamaIndex + Neo4j Vector β€” improved retrieval and contextual understanding across financial, sustainability & research reports
  • Integrated GPT-4 with Neo4j Vector for real-time Q&A β†’ +30% insight extraction accuracy; enabled precise financial analysis workflows at scale
  • Automated data scraping and processing with Selenium + Finnhub API β€” extracted structured datasets from multiple sources to support AI model training and analytics pipelines
  • Designed thematic modeling workflows using embeddings + clustering β€” automated document categorization, enhanced search accuracy, and improved user discovery experience
ML Engineer β€” CV & NLP
Jan 2023 β†’ Aug 2024
πŸŽ“ University of Gujrat
  • Developed real-time web apps integrating Hugging Face vision models with React UI β€” enabling interactive virtual try-on experiences for end users
  • Automated document extraction & compliance workflows using YOLOv8 + OCR (Tesseract) + NER β†’ 95% OCR accuracy, 90% NER precision
  • Delivered construction analytics: YOLOv5 object detection + OpenCV depth estimation β€” improved site monitoring accuracy and automated safety reporting
  • Optimized Flask microservices architecture β†’ 40% faster API response time and improved production reliability
  • Deployed scalable systems: Python Β· FastAPI Β· React Β· PostgreSQL Β· Docker Β· AWS Β· GitHub Actions CI/CD
Client Engagement Lead
Jan 2021 β†’ Dec 2022
πŸ’Ό Freelancer Β· Client Engagement
  • Designed & developed multi-tenant Learning Management Systems (LMS) and food ordering platforms β€” secure access controls, optimized backend logic, and scalable APIs serving multiple client organisations
  • Built UI/UX prototypes in Figma β€” translated client requirements into intuitive interfaces that improved adoption and engagement metrics
  • Implemented backend services in Python Β· FastAPI Β· Flask Β· PostgreSQL β€” modular, production-grade architecture with full test coverage
  • Deployed on Docker + AWS with integrated CI/CD pipelines via GitHub Actions for zero-downtime delivery
  • Applied Agile/XP practices β€” rapid iteration cycles, continuous client feedback, and demo-driven delivery

Projects

01
🧠

AI Knowledge Repository

A deployed knowledge management system on GCP Cloud Run. Intelligent document ingestion, semantic search, and context-aware Q&A for organizational knowledge.

GCP Cloud Run LangChain Vector DB FastAPI
02
πŸ“Š

Presentation Co-Pilot

AI-powered presentation assistant. Auto-generates slide decks from natural language prompts, deployed on Railway with a polished frontend.

GPT-4 Google Slides API Railway React
03
πŸ”

AI Stack Overflow Summarizer

Summarizes Stack Overflow threads using LLMs β€” extracts the best answer signals, strips noise, and serves concise dev-friendly explanations.

LLM Python FastAPI SO API
04
πŸͺ–

Helmet & License Plate Detection

Real-time safety compliance system using YOLOv8 β€” detects helmets on construction sites and reads license plates with OCR for access control.

YOLOv8 OpenCV Tesseract OCR Python
05
πŸ—οΈ

Terraform Visualizer

Parses Terraform config files and generates interactive dependency graphs β€” helps DevOps teams understand infrastructure topology at a glance.

Terraform Python Graph viz React
06
πŸ”§

Resume-to-Job Match RAG

Semantic resume ↔ JD matching with explainability. Per-field match scoring, gap analysis, and LangGraph improvement agent.

⏳ Building now...

FastAPI OpenSearch LangGraph RAGAS

Let's connect

Open to remote roles, freelance projects, and interesting collaborations. Drop me a message β€” I reply fast ⚑