-
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 473k • • 586 -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 11.4M • • 5.22k -
Toolformer: Language Models Can Teach Themselves to Use Tools
Paper • 2302.04761 • Published • 12 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 31
Collections
Discover the best community collections!
Collections including paper arxiv:2302.04761
-
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
Paper • 2307.16789 • Published • 101 -
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Paper • 2308.00675 • Published • 36 -
Toolformer: Language Models Can Teach Themselves to Use Tools
Paper • 2302.04761 • Published • 12 -
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Paper • 2305.18752 • Published • 4
-
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 108 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 25 -
Hierarchical Reasoning Model
Paper • 2506.21734 • Published • 46
-
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 35 -
AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Paper • 2503.02268 • Published • 11 -
Machine Learning Operations (MLOps): Overview, Definition, and Architecture
Paper • 2205.02302 • Published • 1 -
Beyond Browsing: API-Based Web Agents
Paper • 2410.16464 • Published • 2
-
Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning
Paper • 2211.04325 • Published • 1 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 25 -
On the Opportunities and Risks of Foundation Models
Paper • 2108.07258 • Published • 2 -
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Paper • 2204.07705 • Published • 2
-
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 473k • • 586 -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 11.4M • • 5.22k -
Toolformer: Language Models Can Teach Themselves to Use Tools
Paper • 2302.04761 • Published • 12 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 31
-
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 108 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 25 -
Hierarchical Reasoning Model
Paper • 2506.21734 • Published • 46
-
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 35 -
AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Paper • 2503.02268 • Published • 11 -
Machine Learning Operations (MLOps): Overview, Definition, and Architecture
Paper • 2205.02302 • Published • 1 -
Beyond Browsing: API-Based Web Agents
Paper • 2410.16464 • Published • 2
-
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
Paper • 2307.16789 • Published • 101 -
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Paper • 2308.00675 • Published • 36 -
Toolformer: Language Models Can Teach Themselves to Use Tools
Paper • 2302.04761 • Published • 12 -
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Paper • 2305.18752 • Published • 4
-
Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning
Paper • 2211.04325 • Published • 1 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 25 -
On the Opportunities and Risks of Foundation Models
Paper • 2108.07258 • Published • 2 -
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Paper • 2204.07705 • Published • 2