ModelTC
Pinned Loading
Repositories
- Qwen-Image-Edit-Causal Public
In our implementation of Qwen-Image-Edit, we employ block causal attention to improve inference speed.
ModelTC/Qwen-Image-Edit-Causal’s past year of commit activity - LightLLM Public
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
ModelTC/LightLLM’s past year of commit activity - QVGen Public
[ICLR 2026] This is the official PyTorch implementation of "QVGen: Pushing the Limit of Quantized Video Generative Models".
ModelTC/QVGen’s past year of commit activity - SageAttention3-sparse Public Forked from thu-ml/SageAttention
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
ModelTC/SageAttention3-sparse’s past year of commit activity - VideoAlign Public
ModelTC/VideoAlign’s past year of commit activity - HPSv3 Public
ModelTC/HPSv3’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…