Skip to content
Change the repository type filter

All

    Repositories list

    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      12k0011Updated Jan 6, 2026Jan 6, 2026
    • aibrix

      Public
      Cost-efficient and pluggable Infrastructure components for GenAI inference
      Jupyter Notebook
      5060013Updated Jan 5, 2026Jan 5, 2026
    • production-stack

      Public
      vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
      Python
      349003Updated Jan 5, 2026Jan 5, 2026
    • gpucost

      Public
      Python
      0000Updated Dec 3, 2025Dec 3, 2025
    • evals

      Public
      TypeScript
      0001Updated Nov 6, 2025Nov 6, 2025
    • TypeScript
      0002Updated Oct 21, 2025Oct 21, 2025
    • Shell
      2000Updated Sep 29, 2025Sep 29, 2025
    • dynamo

      Public
      A Datacenter Scale Distributed Inference Serving Framework
      Rust
      764006Updated Sep 24, 2025Sep 24, 2025
    • Objective-C
      0100Updated Sep 18, 2025Sep 18, 2025
    • Python
      4000Updated Sep 16, 2025Sep 16, 2025
    • EAGLE

      Public
      Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.
      Python
      238002Updated Aug 6, 2025Aug 6, 2025
    • 根据MiniCPM4训练Eagle3的投机解码模型
      Python
      1003Updated Aug 6, 2025Aug 6, 2025
    • LMCache

      Public
      Redis for LLMs
      Python
      840001Updated Jul 29, 2025Jul 29, 2025
    • Swift
      858000Updated Feb 12, 2025Feb 12, 2025
    • Cookbook

      Public
      MDX
      165932Updated Dec 30, 2024Dec 30, 2024
    • langfuse

      Public
      🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
      TypeScript
      2k000Updated Dec 23, 2024Dec 23, 2024
    • A type-safe, Swift-language layer over SQLite3.
      C
      1.6k001Updated Oct 28, 2024Oct 28, 2024
    • A declarative library for application development using cloud services.
      Swift
      225006Updated Oct 19, 2024Oct 19, 2024
    • Swift SDK for Clickstream Analytics on AWS
      Swift
      2000Updated Jul 28, 2024Jul 28, 2024
    • TypeScript
      1302Updated Oct 25, 2023Oct 25, 2023