cs undergrad · currently exploring reinforcement learning
blogs · paper summaries · x
writing about things i understand, summarizing papers i like, and always happy to connect — pradheepraop [at] gmail [dot] com
cs undergrad · currently exploring reinforcement learning
blogs · paper summaries · x
writing about things i understand, summarizing papers i like, and always happy to connect — pradheepraop [at] gmail [dot] com
An adaptive model selection system for efficient on-device NLP inference, enhancing speed, privacy, and resource use on edge devices.
TypeScript 1
Hybrid AI framework combining reinforcement learning and genetic algorithms to optimize Formula 1 front wing aerodynamic designs. Features neural network-guided optimization, CFD analysis, structur…
Energy-aware DDPG RL framework that dynamically optimizes LLM speculative decoding parameters based on real-time hardware metrics (CPU/GPU temps, battery). Maintains 95-98% energy utilization to ma…
Python
Intelligent agent that converts natural language prompts into production-ready multi-file codebases with automatic dependency resolution, Docker validation, and iterative error correction.
Multi-Teacher Monte Carlo Tree Search (MT-MCTS) for code generation, where 3+ diverse teacher models (DeepSeek-Coder, CodeLlama, Qwen2.5-Coder) propose token paths that form the MCTS search tree. T…
Python 1