Skip to content
Change the repository type filter

All

    Repositories list

    • mlx-audio

      Public
      A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Sil…
      Python
      MIT License
      562600Updated Apr 24, 2026Apr 24, 2026
    • MOSS-VL

      Public
      MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.
      Python
      Apache License 2.0
      423100Updated Apr 24, 2026Apr 24, 2026
    • Llamascopium

      Public
      Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.
      Python
      2821480Updated Apr 20, 2026Apr 20, 2026
    • MOSS-Audio

      Public
      MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenar…
      Python
      916810Updated Apr 20, 2026Apr 20, 2026
    • JavaScript
      43200Updated Apr 17, 2026Apr 17, 2026
    • MOSS-TTS-Nano

      Public
      MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for real…
      Python
      Apache License 2.0
      2992.2k324Updated Apr 17, 2026Apr 17, 2026
    • sglang

      Public
      Python
      Apache License 2.0
      0300Updated Apr 14, 2026Apr 14, 2026
    • MOSS-Audio-Tokenizer

      Public
      MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming an…
      Python
      Apache License 2.0
      1319531Updated Apr 13, 2026Apr 13, 2026
    • CSS
      1100Updated Apr 13, 2026Apr 13, 2026
    • MOSS-TTS

      Public
      MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressive…
      Python
      Apache License 2.0
      1541.7k231Updated Apr 13, 2026Apr 13, 2026
    • A real-time video understanding foundation model built on Llama-3.2-Vision, featuring comprehensively extended video processing and multimodal reasoning capabil…
      Python
      Apache License 2.0
      413600Updated Apr 13, 2026Apr 13, 2026
    • Vue
      0500Updated Apr 9, 2026Apr 9, 2026
    • llama.cpp

      Public
      C++
      MIT License
      2302Updated Apr 8, 2026Apr 8, 2026
    • BandPO

      Public
      Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning. BandPO replaces canoni…
      Python
      GNU General Public License v3.0
      44800Updated Apr 8, 2026Apr 8, 2026
    • Python
      0700Updated Apr 3, 2026Apr 3, 2026
    • MOVA

      Public
      MOVA: Towards Scalable and Synchronized Video–Audio Generation
      Python
      Apache License 2.0
      83958271Updated Apr 1, 2026Apr 1, 2026
    • A library for mechanistic interpretability of GPT-style language models
      Python
      MIT License
      555200Updated Mar 31, 2026Mar 31, 2026
    • Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning
      Python
      Apache License 2.0
      12704Updated Mar 30, 2026Mar 30, 2026
    • DiRL

      Public
      Python
      Apache License 2.0
      715801Updated Mar 30, 2026Mar 30, 2026
    • OurClaw

      Public
      Institutional OpenClaw Solution. Share One Claw with Others.
      TypeScript
      MIT License
      32400Updated Mar 30, 2026Mar 30, 2026
    • RoboOmni

      Public
      Official code of "RoboOmni: Proactive Robot Manipulation in Omni-modal Context"
      Python
      510560Updated Mar 28, 2026Mar 28, 2026
    • MOSS-TTSD

      Public
      MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, a…
      Python
      Apache License 2.0
      1261.3k520Updated Mar 23, 2026Mar 23, 2026
    • TTSD-eval

      Public
      Python
      0400Updated Mar 16, 2026Mar 16, 2026
    • JavaScript
      0200Updated Mar 3, 2026Mar 3, 2026
    • Website

      Public
      wangye
      JavaScript
      3001Updated Mar 2, 2026Mar 2, 2026
    • FRoM-W1

      Public
      [ArXiv 26] FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions
      Python
      Apache License 2.0
      715430Updated Feb 13, 2026Feb 13, 2026
    • MOSS-Speech is a true speech-to-speech large language model without text guidance.
      Python
      Apache License 2.0
      713020Updated Feb 13, 2026Feb 13, 2026
    • RoboJuDo

      Public
      [ArXiv 26] The Depolyment Framework for the FRoM-W1 Project
      Python
      Other
      63400Updated Jan 28, 2026Jan 28, 2026
    • Python
      12200Updated Jan 22, 2026Jan 22, 2026
    • ABC-Bench

      Public
      ABC-Bench is a benchmark for Agentic Backend Coding. It evaluates whether code agents can explore real repositories, edit code, configure environments, deploy c…
      22910Updated Jan 20, 2026Jan 20, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.