I'm Sushil Dalavi, an AI Engineer at the USC Annenberg Norman Lear Center and an MS in Computer Science candidate at USC (2024 β 2026).
I architect production AI systems β AWS data platforms, hybrid retrieval pipelines, distributed LLM workflows, and multi-modal ML β with an emphasis on measurable outcomes, reliability, and reproducibility.
| πΌ | Open to SDE / SWE / AIΒ·ML Engineer / Applied AI roles |
| ποΈ | AWS data platforms, distributed workflows, LLM inference gateways |
| π§ | Hybrid retrieval, reranking, MLOps, multi-modal alignment |
| π | Building JobSense, ScribeAI, and ScholarRAG |
| π | Motivated by real-world product impact |
| β½ | Proud Real Madrid supporter |
| π₯ | Huge anime geek |
USC Annenberg Norman Lear Center AI Engineer π Los Angeles, CA Β |Β π Jun 2025 β Present |
Reliance Jio Platforms Software Engineer π Navi Mumbai, India Β |Β π Dec 2023 β Jul 2024 |
π Highlights from USC Annenberg Norman Lear Center
- Architected an AWS data platform (S3, Glue, SageMaker, Bedrock) ingesting, deduplicating, and normalizing 1M+ multi-region records for downstream ML training and retrieval workloads.
- Shipped a multi-modal alignment system fusing audio, speaker diarization, and caption streams β reaching 99.3% F1 and 99.9% coverage on ground-truth evaluation.
- Developed large-scale batch pipelines processing long-form video and audio through Whisper ASR, pyannote diarization, and model-based refinement stages.
- Automated dataset QA, Unicode normalization, and deduplication in Python β lifting analysis-ready yield from 10,819 β 9,735 records with full reproducibility.
π Highlights from Reliance Jio Platforms
- Trained and deployed ResNet-50 and DenseNet-121 deep vision networks for medical image anomaly detection β improving recall by 35% via transfer learning, augmentation, and loss tuning.
- Optimized quantized transformer inference (BERT, GPT-2) on GPU with batched serving β cutting p95 latency by 30% while preserving accuracy gains.
- Engineered demand-forecasting microservices (TFT, CatBoost, LSTM) over Hive SQL batch pipelines, reducing forecast MAPE by 25% for business-critical workloads.
- Rolled out shadow-testing and canary-release workflows for 3 production ML upgrades, catching 2 latency regressions before fleet-wide deployment.
π§ JobSenseDurable distributed workflow platform β a fault-tolerant orchestration system on Temporal with 12 tool integrations, human-in-the-loop checkpoints, and a provider-agnostic inference gateway. Highlights
Stack |
βοΈ ScribeAIInference service with evaluation pipeline β async FastAPI service with SSE streaming, multi-backend routing (GPT-4o, Claude, fallback), and an MLflow-tracked evaluation harness. Highlights
Stack |
π ScholarRAGRetrieval and data engineering system β a hybrid retrieval pipeline for scholarly discovery with citation-aware grounding. Highlights
Stack |
π₯ MedSOAPClinical documentation automation β generates structured SOAP notes from doctor-patient conversations. Highlights
Stack |
| π¬ | Love webseries and serious binge watching | π | Swimming keeps me grounded |
| π | Enjoy table tennis | β½ | Lifelong football fan |
| π₯ | Huge anime geek | π§ | Music always around |
A proud Real Madrid supporter β I love the mentality, the standards, the legacy, and the winning culture.
I'm especially interested in opportunities where strong software engineering meets AI/ML, backend systems, and data-driven product building.






