LLMs/MLMs之Qwen-3:Qwen3的简介、安装和使用方法、案例应用之详细攻略
阅55转0刚刚
MLLM之Bench:LEGO-Puzzles的简介、安装和使用方法、案例应用之详细攻略
阅2转0刚刚
LLMs之DeepSeek-R1:基于TinyZero项目(Huggingface TRL框架+Countdown-Tasks数据集+Qwen-2.5-3B模型)复现DeepSeek R1 Zero模
阅1转0刚刚
LLMs之Agent之RL:RAGEN的简介、安装和使用方法、案例应用之详细攻略
阅2转0刚刚
LLM之LRMs:《Revisiting Prompt Optimization with Large Reasoning Models-A Case Study on Event Extractio
阅3转0刚刚
MLMs之OpenAI o系列:OpenAI o3/o4-mini的简介、安装和使用方法、案例应用之详细攻略
阅5转0刚刚
LLMs之RAG:《RAG Agents in Production: 10 Lessons We Learned》翻译与解读
阅23转0刚刚
LLMs之Agent之A2A:A2A的简介、安装和使用方法、案例应用之详细攻略
阅31转0刚刚
LLMs之Agent之A2A:《Announcing the Agent2Agent Protocol (A2A)》翻译与解读
阅10转0刚刚
MLMs之Benchmark:《InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Hum
阅2转0刚刚
LLMs之KGRAG:《MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning
阅11转0刚刚
LLMs之Agent:《OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning》翻译与解读
阅3转0刚刚
LLMs之RL之CoT:《Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation》翻译与解
阅12转0刚刚
LLMs之RL之CPPO:《CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning
阅8转0刚刚
MLMs之MoE之Chart:《ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding》翻译与解
阅5转0刚刚
LLMs之RL之TAO:《TAO: Using test-time compute to train efficient LLMs without labeled data》翻译与解读
阅4转0刚刚
LLMs之PE:《Tracing the thoughts of a large language model》翻译与解读
阅9转0刚刚
LLMs之Agent:AI-Researcher的简介、安装和使用方法、案例应用之详细攻略
阅16转0刚刚
LLMs之DeepSeek-V3:DeepSeek-V3-0324的简介、安装和使用方法、案例应用之详细攻略
阅27转0刚刚
LLMs之CoTM:《Detecting misbehavior in frontier reasoning models》翻译与解读
阅9转0刚刚
-
设计心理学2:与复杂共处