A-Mahla (Amir Mahla)

upvoted 3 articles 3 months ago

Article

Continuous batching from first principles

+1

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 380

Article

Mixture of Experts (MoEs) in Transformers

+5

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 159

Article

I Let a Lobster Run My Jetson: What OpenClaw Taught Me About the Future of Computing

andito

•

Feb 19

• 16

upvoted a paper 4 months ago

CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

Paper • 2601.09923 • Published Jan 14 • 5

upvoted an article 5 months ago

Article

cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents

cua-ai

•

Dec 16, 2025

• 12

upvoted a paper 6 months ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Paper • 2510.24702 • Published Oct 28, 2025 • 31

upvoted an article 7 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+8

spisakjo, darktex, zkwentz, mortimerp9, Sanyam, Hamid-Nazeri, Pankit01, emre0, lewtun, reach-vb

•

Oct 23, 2025

• 162

upvoted 4 papers 7 months ago

upvoted 3 articles 8 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

+9

clefourrier, gregmialz, mlcu, mortimerp9, XciD, tfrere, evijit, RomainFroger, dheeraj7596, CarolinePascal, upiter

•

Sep 22, 2025

• 134

Article

PrediBench: Testing AI models on prediction markets

charles-azam

•

Sep 24, 2025

• 5

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

A-Mahla, merve, sergiopaniego, reach-vb, lewtun

•

Sep 23, 2025

• 138

upvoted a paper 8 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 127

upvoted an article 8 months ago

Article

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

anakin87

•

Sep 4, 2025

• 30

upvoted an article 10 months ago

Article

ScreenEnv: Deploy your full stack Desktop Agent

A-Mahla, m-ric

•

Jul 10, 2025

• 76

upvoted a paper 11 months ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26, 2025 • 52

upvoted 2 articles 11 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

moonshotai

•

Jun 21, 2025

• 77

Article

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

+1

A-Mahla, m-ric, thomwolf

•

Jun 6, 2025

• 56

Amir Mahla

AI & ML interests

Organizations

Continuous batching from first principles

Mixture of Experts (MoEs) in Transformers

I Let a Lobster Run My Jetson: What OpenClaw Taught Me About the Future of Computing

CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Building the Open Agent Ecosystem Together: Introducing OpenEnv

FineVision: Open Data Is All You Need

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

UI-Venus Technical Report: Building High-performance UI Agents with RFT

Robot Learning: A Tutorial

Gaia2 and ARE: Empowering the community to study agents

PrediBench: Testing AI models on prediction markets

Smol2Operator: Post-Training GUI Agents for Computer Use

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

ScreenEnv: Deploy your full stack Desktop Agent

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Amir Mahla

AI & ML interests

Organizations

A-Mahla's activity

Continuous batching from first principles

Mixture of Experts (MoEs) in Transformers

I Let a Lobster Run My Jetson: What OpenClaw Taught Me About the Future of Computing

cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Gaia2 and ARE: Empowering the community to study agents

PrediBench: Testing AI models on prediction markets

Smol2Operator: Post-Training GUI Agents for Computer Use

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

ScreenEnv: Deploy your full stack Desktop Agent

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!