Change the repository type filter
All
Repositories list
53 repositories
verl-tool
PublicVideoScore2
PublicVideoScore
PublicImagenHub
PublicA one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]EditReward
PublicMMLU-Pro
PublicVisCoder2
Public- This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]
Pixel-Reasoner
PublicQuickCodec
PublicBrowserAgent
PublicImagenWorld
PublicQuickVideo
PublicQuick Long Video UnderstandingHierarchical-Reasoner
PublicCritique-Coder
PublicVideoEval-Pro
PublicMore reliable Video Understanding EvaluationStructEval
PublicVisCoder
PublicPixelWorld
PublicOne-Shot-CFT
PublicVisualWebInstruct
PublicABC
PublicABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]Vamba
PublicTheoremExplainAgent
PublicOfficial Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]CritiqueFineTuning
PublicCode for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]ScholarCopilot
PublicMEGA-Bench
PublicThis repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR 2025]DisProtEdit
PublicVL-Rethinker
Public