[LLM] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
[LLM] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning paper: https://arxiv.org/pdf/2501.12948 github: https://github....
[LLM] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning paper: https://arxiv.org/pdf/2501.12948 github: https://github....
[LLM] DeepSeek-V3 Technical Report
[MM] Cambrian-1: A Fully Open,Vision-Centric Exploration of Multimodal LLMs
[MM] InternVL-2.5-MPO: Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
[MM] InternVL-2.5: Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling