Skip to content
Change the repository type filter

All

    Repositories list

    • DOMINO

      Public
      Towards Generalizable Robotic Manipulation in Dynamic Environments
      Python
      Apache License 2.0
      818420Updated Apr 22, 2026Apr 22, 2026
    • NUMINA

      Public
      [CVPR 2026] When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
      Python
      MIT License
      66300Updated Apr 11, 2026Apr 11, 2026
    • HyDRA

      Public
      Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
      Python
      1323521Updated Apr 10, 2026Apr 10, 2026
    • VEGA-3D

      Public
      Official code of "Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding"
      Python
      Apache License 2.0
      2042200Updated Apr 9, 2026Apr 9, 2026
    • PointTPA

      Public
      [CVPR 2026] PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding
      Python
      MIT License
      12410Updated Apr 7, 2026Apr 7, 2026
    • Python
      MIT License
      0000Updated Mar 29, 2026Mar 29, 2026
    • PointGST

      Public
      [IEEE TPAMI] Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning
      Python
      Apache License 2.0
      35100Updated Mar 24, 2026Mar 24, 2026
    • MindDrive

      Public
      Official code of “MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning”
      Python
      Apache License 2.0
      18100Updated Feb 12, 2026Feb 12, 2026
    • .github

      Public
      readme
      0000Updated Feb 5, 2026Feb 5, 2026
    • NAUTILUS

      Public
      [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding
      Python
      2836230Updated Dec 18, 2025Dec 18, 2025
    • GRANT

      Public
      [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution
      Python
      Apache License 2.0
      1236200Updated Dec 12, 2025Dec 12, 2025
    • MERGE

      Public
      [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models
      Python
      Apache License 2.0
      1821600Updated Oct 31, 2025Oct 31, 2025
    • EasyCache

      Public
      Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching
      Python
      Apache License 2.0
      429010Updated Aug 29, 2025Aug 29, 2025
    • Collect some World Models for Autonomous Driving (and Robotic) papers.
      79000Updated Jul 14, 2025Jul 14, 2025
    • HERMES

      Public
      [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
      Python
      Apache License 2.0
      12000Updated Jul 13, 2025Jul 13, 2025
    • Orion

      Public
      [ICCV 2025] Official code of "ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation"
      Python
      Apache License 2.0
      67200Updated Jun 26, 2025Jun 26, 2025
    • [NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis
      Python
      Apache License 2.0
      41000Updated Mar 19, 2025Mar 19, 2025
    • UniSeg3D

      Public
      [NeurIPS 2024] A Unified Framework for 3D Scene Understanding
      Python
      Apache License 2.0
      9000Updated Nov 28, 2024Nov 28, 2024
    • DAPT

      Public
      [CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
      Python
      Apache License 2.0
      8000Updated Oct 11, 2024Oct 11, 2024
    • SAM3D

      Public
      [SCIS] SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model
      Python
      14000Updated Jan 28, 2024Jan 28, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.