Skip to content

MahdiNavaei/MahdiNavaei

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

17 Commits
Β 
Β 

Repository files navigation

Mahdi Navaei

AI Engineer β€’ Agentic AI Architect β€’ Data Scientist

Profile Views LinkedIn Email Kaggle

8+ years building production-grade AI systems | Tehran, Iran

Specializing in Agentic AI, LLMs, RAG Systems, and Enterprise ML Pipelines


🎯 What I Do

I architect and deploy production-grade intelligent systems that drive real business impact. My focus areas:

Domain Expertise
Agentic AI Autonomous agents, multi-step reasoning, task orchestration, tool integration
LLMs & NLP Fine-tuning, RAG architectures, NL2SQL, conversational AI, prompt engineering
Enterprise RAG Hybrid retrieval, re-ranking pipelines, knowledge management systems
Production ML End-to-end pipelines, real-time inference, MLOps, scalable deployments

πŸš€ Flagship Project 1

πŸ€– Production-Grade Agentic AI Framework

Vision + LLM + Event Sourcing β€’ Local LLMs β€’ LangGraph β€’ HITL Safety β€’ Autonomous Task Execution

Tests Python LangGraph License Docker

ARIA is not a prompt-chain demo or a single-purpose script β€” it's a full agentic AI system built for real-world automation: observe UIs with vision, plan with LLMs, act safely with human oversight, and learn from outcomes. Designed to run on local LLMs and consumer GPUs (8GB VRAM), with native English & Persian support for privacy-sensitive and resource-constrained environments.

Cognitive architecture β€” perception, reasoning, execution, and memory are separated and observable:

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                         ARIA β€” Cognitive Core                            β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚  πŸ‘οΈ Eye (VLM/OCR)   β†’  Observe real interfaces β€’ Screenshot β€’ UIRef    β”‚
β”‚  🧠 Brain (LLM)      β†’  Plan, execute, observe β€’ LangGraph β€’ HITL gates  β”‚
β”‚  βœ‹ Hand (Actions)   β†’  Browser β€’ Desktop β€’ Playwright β€’ PyAutoGUI       β”‚
β”‚  πŸ’Ύ Memory           β†’  Working + Episodic + Semantic (Redis β€’ Qdrant)   β”‚
β”‚  πŸ“‘ Event Bus        β†’  Kafka/Redpanda β€’ Full audit trail & replay       β”‚
β”‚  πŸ“š Learning         β†’  Extract skills & policies from successful runs   β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Why ARIA stands out:

Pillar What it means for you
Vision-First VLM-powered UI understanding with multi-locator fallback β€” no brittle selectors only
Event-Sourced Every step persisted; full audit trail and replay for debugging and compliance
Human-in-the-Loop Safety gates for sensitive actions (login, CAPTCHA, payment) β€” production-safe by design
Local & Bilingual Run entirely on your hardware; native Farsi STT (Whisper) and embeddings
Production-Ready FastAPI + WebSocket API, Streamlit dashboard, Docker Compose, 81 tests

Tech stack: LangGraph β€’ Ollama / OpenAI β€’ Qwen-VL β€’ Playwright β€’ Redpanda (Kafka) β€’ Redis β€’ Qdrant β€’ Mem0

The Job Apply automation (LinkedIn, Indeed) is the first production plugin β€” the platform is built for more.

πŸ”— Explore ARIA β†’ β€’ πŸ“– Docs, ADRs, and MODELS.md inside the repo


πŸš€ Flagship Project 2

πŸ“„ Governance-Safe Financial Document AI

Bilingual (EN/FA) β€’ Quality Gates β€’ Human-in-the-Loop Review β€’ Replayable Lifecycle β€’ Audit Endpoints

Release Python Next.js License FastAPI

InvoiceMind β€” Evidence-first invoice processing

InvoiceMind is not an OCR benchmark or a generic prompt demo β€” it's a production-oriented platform for invoice extraction, human review, and governance-safe automation. Built for teams where traceability and control matter more than blind automation. Most invoice AI fails in production because decisions are hard to trust, explain, and control; InvoiceMind tackles that gap head-on.

End-to-end flow β€” from ingestion to final export, with explicit gates and audit at every step:

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                    InvoiceMind β€” Pipeline & Lifecycle                    β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚  πŸ“₯ Ingestion β†’ Validation β†’ OCR/Layout β†’ LLM Extraction β†’ Postprocess  β”‚
β”‚  πŸ“Š Routing (quality gates) β†’ Review / Quarantine β†’ Export + Audit        β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚  Run lifecycle: RECEIVED β†’ VALIDATED β†’ EXTRACTED β†’ GATED β†’               β”‚
β”‚                 AUTO_APPROVED | NEEDS_REVIEW β†’ FINALIZED                 β”‚
β”‚  Control: cancel β€’ replay β€’ quarantine (reason-coded) β€’ audit/verify      β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Why InvoiceMind stands out:

Pillar What it means for you
Evidence-first Policy and gate-based routing instead of confidence-only automation
Decision traceability Every auto-approve or escalate tied to gate results and reason codes
Replayable & auditable Full run lifecycle, cancel/replay, and audit endpoints for compliance and post-incident analysis
Local-first Privacy-first inference; versioned config bundles and model registry (models.yaml)
Safe defaults Quarantine and human review over aggressive auto-posting; NIST AI RMF & OWASP LLM–aligned

Tech stack: Python 3.11+ β€’ FastAPI β€’ Next.js 16 β€’ React 19 β€’ TypeScript β€’ SQLAlchemy β€’ Alembic β€’ SQLite β€’ AGPL-3.0

ADR-001 (local-first), ADR-002 (evidence-first), ADR-003 (policy-driven gates) β€” design documented in the repo.

πŸ”— Explore InvoiceMind β†’ β€’ πŸ“– Docs, run.bat one-click startup, API surface in README


πŸ’Ό Featured Projects

πŸ›‘οΈ DriveShield β€” Real-Time Collision Risk Intelligence

End-to-end collision prediction platform using Nexar's BADAS-Open model.

  • State-of-the-Art Prediction: Real-time risk analysis with vision models
  • 100% Offline: Runs locally without external API calls
  • Production-Ready: FastAPI backend + React TypeScript frontend

Tech: Python β€’ FastAPI β€’ React β€’ TypeScript β€’ PyTorch β€’ Computer Vision

πŸ”— View Repository β†’

πŸ”„ Hybrid Retail Recommender System

Production-ready hybrid recommender combining collaborative filtering & content-based ML.

  • Results: 140% precision improvement, 175% recall improvement
  • Scale: Tested on 38K+ user dataset
  • Bilingual: English/Persian UI with RTL support

Tech: Python β€’ FastAPI β€’ React β€’ TypeScript β€’ scikit-learn β€’ Docker

πŸ”— View Repository β†’

🌊 FlowCast β€” Surge Pricing & ETA Optimization Engine

Enterprise-grade intelligent pricing and ETA prediction for ride-hailing platforms.

  • ETA Accuracy: +20% improvement over baseline
  • Revenue: +10-25% efficiency per trip
  • Price Stability: 30-40% volatility reduction

Tech: Python β€’ FastAPI β€’ React β€’ GeoPandas β€’ Time-Series Forecasting

πŸ”— View Repository β†’

πŸ’Š Pharmaceutical Supply Chain Agentic AI

Four-agent system for supply chain optimization using LangGraph orchestration.

  • Logistics Costs: 40% reduction
  • Stockouts: 67% reduction
  • Forecast Accuracy: 95%+ (MAPE < 5%)

Tech: Python β€’ FastAPI β€’ LangGraph β€’ Next.js β€’ MongoDB β€’ GPT-4o-mini

πŸ”— View Repository β†’

πŸ“š More Projects
Project Description Tech
Blood Cell Cancer Detection CNN-based classifier with 99%+ accuracy TensorFlow β€’ Keras β€’ Medical Imaging
Books Recommendation System Production recommender, 8% sales increase Collaborative Filtering β€’ scikit-learn
Stock Price Collection Automated data pipeline for finance ML Web Scraping β€’ Database Design
CIFAR-10 Classification CNN image classifier, 90%+ accuracy TensorFlow β€’ Keras β€’ CNN

πŸ› οΈ Tech Stack

AI & LLM

OpenAI LangChain LangGraph Ollama RAG Transformers

ML & Data Science

Python PyTorch TensorFlow scikit-learn Pandas

Production & DevOps

FastAPI Docker Kubernetes PostgreSQL Redis

Frontend

React TypeScript Next.js


πŸ† Achievements

Achievement Description
πŸ₯ˆ 2nd Place β€” Tehran Provincial AI Competition (2022)
πŸŽ“ Member β€” Iran's National Elites Foundation
πŸ“œ Kaggle Notebooks Master
πŸ“„ Published Researcher β€” Health Science Reports (Wiley), ICVPR, AMLAI

πŸ“Š GitHub Stats

GitHub Stats GitHub Streak

πŸ“š Publications


πŸ’Ό Experience

Role Company Period
Senior AI/ML Engineer Daria Hamrah Paytakht Jul 2024 – Present
Senior AI/ML Engineer Educational Industries Research & Innovation Co Nov 2023 – Jul 2024
Data Science Team Lead Diar-e Kohan CO. Sep 2020 – May 2022
Data Scientist Diar-e Kohan CO. Sep 2018 – Sep 2020

🎯 Open To

  • πŸš€ Building Agentic AI systems and LLM applications at innovative companies
  • πŸ’Ό Production-grade AI systems that solve real business problems
  • 🌍 Collaborating with international teams on cutting-edge AI/ML projects
  • 🀝 Remote positions, contract work, or full-time opportunities worldwide

Let's Connect

LinkedIn GitHub Kaggle Email


⭐ If you find my work interesting, please consider giving my repositories a star!

Building the future of AI, one system at a time.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors