Skip to content
View sushildalavi's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report sushildalavi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sushildalavi/README.md
header

Typing SVG

profile views Β  followers


About Me

coding gif

I'm Sushil Dalavi, an AI Engineer at the USC Annenberg Norman Lear Center and an MS in Computer Science candidate at USC (2024 – 2026).

I architect production AI systems β€” AWS data platforms, hybrid retrieval pipelines, distributed LLM workflows, and multi-modal ML β€” with an emphasis on measurable outcomes, reliability, and reproducibility.


πŸ’Ό Open to SDE / SWE / AIΒ·ML Engineer / Applied AI roles
πŸ—οΈ AWS data platforms, distributed workflows, LLM inference gateways
🧠 Hybrid retrieval, reranking, MLOps, multi-modal alignment
πŸ“š Building JobSense, ScribeAI, and ScholarRAG
🌍 Motivated by real-world product impact
⚽ Proud Real Madrid supporter
πŸ₯ Huge anime geek


πŸŽ“ Education


USC

University of Southern California
MS in Computer Science
πŸ“ Los Angeles, CA Β |Β  πŸ“… Aug 2024 – May 2026




University of Mumbai

University of Mumbai
BE in Computer Engineering
πŸ“ Mumbai, India Β |Β  πŸ“… Jun 2019 – May 2023




πŸ’Ό Work Experience


USC Annenberg Norman Lear Center

USC Annenberg Norman Lear Center
AI Engineer
πŸ“ Los Angeles, CA Β |Β  πŸ“… Jun 2025 – Present




Reliance Jio

Reliance Jio Platforms
Software Engineer
πŸ“ Navi Mumbai, India Β |Β  πŸ“… Dec 2023 – Jul 2024



πŸ“Œ Highlights from USC Annenberg Norman Lear Center
  • Architected an AWS data platform (S3, Glue, SageMaker, Bedrock) ingesting, deduplicating, and normalizing 1M+ multi-region records for downstream ML training and retrieval workloads.
  • Shipped a multi-modal alignment system fusing audio, speaker diarization, and caption streams β€” reaching 99.3% F1 and 99.9% coverage on ground-truth evaluation.
  • Developed large-scale batch pipelines processing long-form video and audio through Whisper ASR, pyannote diarization, and model-based refinement stages.
  • Automated dataset QA, Unicode normalization, and deduplication in Python β€” lifting analysis-ready yield from 10,819 β†’ 9,735 records with full reproducibility.
πŸ“Œ Highlights from Reliance Jio Platforms
  • Trained and deployed ResNet-50 and DenseNet-121 deep vision networks for medical image anomaly detection β€” improving recall by 35% via transfer learning, augmentation, and loss tuning.
  • Optimized quantized transformer inference (BERT, GPT-2) on GPU with batched serving β€” cutting p95 latency by 30% while preserving accuracy gains.
  • Engineered demand-forecasting microservices (TFT, CatBoost, LSTM) over Hive SQL batch pipelines, reducing forecast MAPE by 25% for business-critical workloads.
  • Rolled out shadow-testing and canary-release workflows for 3 production ML upgrades, catching 2 latency regressions before fleet-wide deployment.


πŸ› οΈ Tech Stack

Languages

Β 

ML & Deep Learning

Β 

LLMs & Retrieval

Backend & Data Systems

Β 

Cloud & DevOps

Β 

AI-Assisted Development


πŸš€ Featured Projects

🧭 JobSense

Durable distributed workflow platform β€” a fault-tolerant orchestration system on Temporal with 12 tool integrations, human-in-the-loop checkpoints, and a provider-agnostic inference gateway.

Highlights

  • Temporal-based orchestration with automated retries & end-to-end observability
  • Provider-agnostic inference gateway with multi-backend failover & Redis semantic caching
  • CI regression gates blocking merges on quality or cost drift
  • Hybrid retrieval (BM25 + dense + cross-encoder) fused with Reciprocal Rank Fusion

Stack

✍️ ScribeAI

Inference service with evaluation pipeline β€” async FastAPI service with SSE streaming, multi-backend routing (GPT-4o, Claude, fallback), and an MLflow-tracked evaluation harness.

Highlights

  • Graceful degradation under upstream failure across multiple LLM backends
  • MLflow-tracked evaluation: ROUGE, BLEU, BERTScore, faithfulness, leakage checks
  • Compliance-aware pipeline: 10+ PII types redacted, pgcrypto storage, append-only audit log
  • Automated regression alerts on metric drift across versioned releases

Stack

πŸ“˜ ScholarRAG

Retrieval and data engineering system β€” a hybrid retrieval pipeline for scholarly discovery with citation-aware grounding.

Highlights

  • Dense + BM25 + RRF + MiniLM rerank lifting MRR by 21.8% and nDCG@10 by 18.0% over a 120+ query eval harness
  • Duplicate indexing reduced by 50%, re-ingestion time by 60% via DOI/ID/title normalization + SHA-256 content hashing
  • Answer grounding lifted from 0.505 β†’ 0.616 faithfulness; claim support 45.4% β†’ 85.6%
  • Evidence-constrained generation with citation-aware prompting across heterogeneous scholarly sources

Stack

πŸ₯ MedSOAP

Clinical documentation automation β€” generates structured SOAP notes from doctor-patient conversations.

Highlights

  • LLM-driven SOAP note generation with medical entity recognition
  • HIPAA-conscious architecture with audit trails
  • Fine-tuning and evaluation pipeline for clinical summarization
  • Explores healthcare-focused product design patterns

Stack


πŸ“Š GitHub Analytics

GitHub Stats Β Β  Top Languages

GitHub Streak

Activity Graph


πŸ† Trophies

GitHub Trophies


πŸ’¬ Quote I Live By




β€” Aristotle


🎯 Beyond the Code

🎬Love webseries and serious binge watching 🏊Swimming keeps me grounded
πŸ“Enjoy table tennis ⚽Lifelong football fan
πŸ₯Huge anime geek 🎧Music always around

⚽ Hala Madrid

Β 

A proud Real Madrid supporter β€” I love the mentality, the standards, the legacy, and the winning culture.


🎧 Spotify


πŸ” What I'm Looking For

I'm especially interested in opportunities where strong software engineering meets AI/ML, backend systems, and data-driven product building.


🀝 Let's Connect

built with love Β  powered by coffee

footer

Pinned Loading

  1. nanoserve nanoserve Public

    OpenAI-compatible LLM serving engine built from scratch on Apple Silicon. Continuous batching, paged KV-cache, prefix caching, INT8/INT4 quantization, Prometheus/Grafana observability. Benchmarked …

    Python

  2. SOAPFlow SOAPFlow Public

    AI clinical scribe β€” converts doctor-patient transcripts to structured SOAP notes. Six generation backends (OpenAI, Anthropic, Groq, Ollama, MLX LoRA, rule-based), PHI de-identification, vector sea…

    Python

  3. sourcery sourcery Public

    Scholarly RAG app with hybrid retrieval, sentence-level citation grounding, and calibrated per-claim confidence scoring. FastAPI + React + Postgres/pgvector + Ollama. Brier 0.160, AUC 0.852 on held…

    Python

  4. QueryLens QueryLens Public

    PostgreSQL query performance monitor. Collects pg_stat_statements telemetry, fingerprints SQL, snapshots EXPLAIN plans, detects regressions deterministically, and surfaces slow queries in a React d…

    Python

  5. ReplayForge ReplayForge Public

    Async workflow replay and failure-debugging platform. Python/FastAPI control plane, Go data plane, Redis Streams consumer groups, dead-letter queue, exponential backoff, and a React timeline dashbo…

    Python

  6. SchemaPilot SchemaPilot Public

    API contract drift monitor. Polls live JSON endpoints, infers schemas from observed payloads, detects breaking changes deterministically, and optionally generates LLM changelogs over structured diffs.

    Python