Artificial Analysis is the leading independent AI benchmarking company. We support labs, engineers and enterprises to understand AI capabilities and make critical decisions about their AI strategies. We are the go-to authority for understanding AI, from AI labs and enterprises to media, investors, and policymakers. Our benchmarks don't just measure the cutting edge of AI, they are actively shaping the frontier.
Our benchmarks and analysis are trusted by hundreds of thousands of users and are the go-to reference for leading AI labs including OpenAI, Google, Meta, NVIDIA and Anthropic, and major publications including the Wall Street Journal, Bloomberg, the Financial Times and The Economist.
We are a team of 35+, on track to triple by year end, backed by Nat Friedman (Github, Meta), Daniel Gross (SSI), Andrew Ng (Google Brain, DeepLearning.ai, Amazon), Adam D'Angelo (Quora, Poe, OpenAI), Clem Delangue (Hugging Face) and other industry leaders.
Our benchmarks and analysis are what the industry turns to when they need to understand AI capabilities, from AI labs and enterprises to media, investors, and policymakers. This role puts you at the forefront of the AI frontier — you won't just observe the cutting edge of AI, your work will define what cutting edge means.
We're hiring Members of Technical Staff to design the evaluations that set the standard for how AI is measured, produce analysis that shapes how companies and the broader industry understand AI, and work directly with the leading AI labs and enterprises who rely on our insights. You'll develop new benchmarking methodologies, manage relationships with some of the most important AI labs and enterprise customers in the world, and help drive the product direction of our platform. The bar for success is becoming a world expert in modern AI technologies.
This is a unique combination of product, research, technical, and client-facing work, suited to highly driven technical generalists who thrive on breadth and ownership. Many of our strongest team members come from top-tier strategy consulting backgrounds, though we hire from a range of disciplines.
You'll work directly with our founders and across the full team. The people who join now will have an outsized role in shaping both our products and the company itself. The coming wave of AI scaling is going to change the world in ways we don't yet understand — and we're offering a front row seat.
AI Benchmarking Product Development: Structure, design and execute projects to evaluate AI systems and technologies, including developing new AI evaluation methodologies and datasets that advance our benchmarking capabilities
Strategic Analysis of AI: Drive developing reports and data visualizations to communicate complex AI concepts to enterprises looking to shape their AI strategy
Partner with Leading AI Companies: Collaborate with leading AI companies to support them in benchmarking their technologies, from agentic AI applications to models to hardware. This also requires identifying opportunities to enhance our leading AI benchmarking platform and working with our developers to make it happen
Become AI-Native: Embrace an AI-native workflow, using cutting-edge AI tools to generate leverage in a fast-changing industry and maintain our competitive edge in AI benchmarking
Company Strategy: We are a startup and are looking for the future leaders of Artificial Analysis. Team members will be expected to contribute to company strategy and should expect to drive large initiatives that will support us in being the leading AI benchmarking company
You have an intense interest in AI, a genuine desire to become a world expert in the field, and strong analytical and coding skills to back it up.
We hire MTS from three backgrounds. You should clearly fit one of these profiles:
Strategy Consulting — backgrounds include Management Consultant, Associate, Engagement Manager, Data Scientist, or similar roles at firms like McKinsey, BCG, Bain, or equivalent. Strong preference will be given to candidates with experience within a data analytics division such as QuantumBlack, AI by McKinsey, BCG X / Gamma or equivalent. You know how to structure ambiguous problems, build analytical frameworks, and communicate findings to senior stakeholders. You're now looking to apply those skills to something deeply technical. The key differentiator: you have a genuine technical interest in AI and ability to code — you follow model releases, you have opinions on where the technology is heading, and you want to be closer to the subject matter than consulting allows.
AI and Machine Learning — backgrounds include ML Engineer, ML Researcher, AI Engineer, Forward Deployed Engineer, Technical PM, Solutions Engineer, Technical Pre-Sales, or similar roles at AI companies or AI-focused teams. You have hands-on experience with modern AI systems and understand how models work at a technical level. You're looking for a role with more breadth and industry exposure than a pure research, engineering, or PM position, where you can combine technical depth with client-facing work and strategic analysis.
Technical Product Management — backgrounds include Founding Engineer, Product Manager, Technical Co-founder, Head of Product, or generalist roles at early-stage AI companies. You've built and shipped AI products in a fast-moving environment where you had to operate across research, engineering, product, and commercial work simultaneously. You understand both how AI systems work technically and how they get brought to market. You're looking for a role where that breadth is the job, not a side effect of being early at a small company.
Across all three profiles, we require:
Strong analytical and critical thinking skills
Proficiency in Python and data analysis
Genuine, demonstrable interest and knowledge of frontier AI — we want people who have informed opinions about where AI is heading, not just people who use AI tools
Shape how AI gets built: The leading AI labs track our benchmarks and use them to guide their development priorities. Your work will directly influence the direction of AI.
Become a world expert in AI: You will evaluate every major model, across every major capability, as they are released. Very few roles offer this breadth of exposure to frontier AI.
Work with the most important players in AI: You'll manage relationships with teams at the leading AI labs and major enterprises as a trusted, independent voice.
Join at a defining moment: We're 35+ people and fast growing, backed by some of the most connected investors in AI. The people who join now will shape the product, the team, and the strategy as we scale.
Competitive compensation including equity
Our team is split across San Francisco, Sydney, and Melbourne