Tech hiring evolves as candidates ask for AI compute alongside pay and perks

Skye Jacobs

Posts: 1,985   +58
Staff
Winners & losers: Job candidates at top tech firms are beginning to look for something new alongside salary, equity, and bonuses: access to artificial intelligence compute. Engineers interviewing at companies like OpenAI and other AI-focused startups now want to know not just what they'll earn, but also how much GPU and inference capacity they will have to work with. The shift – treating AI compute as personal working capital – is starting to influence how some companies think about talent acquisition, productivity, and budgets.

Inference has quietly become one of the most valuable resources inside software companies. Once just a line buried in cloud bills, it is now treated as a unit of power – one that can determine how fast engineers build products and which experiments actually get off the ground.

OpenAI engineer Thibault Sottiaux recently noted on X that candidates frequently ask how much "dedicated inference compute" they would receive if they joined his team. This question reflects a deeper reality: usage per user is growing faster than overall user growth, suggesting that compute supply is tightening even as demand soars.

That scarcity is creating a hierarchy of access within technical organizations. Teams with access to GPUs or high-performance inference budgets move faster and ship more than those left waiting in the queue.

OpenAI President Greg Brockman put it bluntly: the amount of inference compute available to an engineer increasingly determines software productivity. Inside labs like OpenAI and Anthropic, compute allocation is now treated almost like budget approval – an internal currency traded among priorities and projects.

Outside those labs, investors are taking notice. Theory Ventures partner Tomasz Tunguz says inference access is effectively becoming a fourth pillar of engineering compensation, alongside cash, equity, and bonuses. He predicts engineers will soon negotiate "token budgets" alongside pay.

Tokens are the numerical building blocks that models use to process data – one token equals roughly three-quarters of a word – and inference platforms sell access by the million tokens. The more tokens allocated to a developer or project, the more work the AI can perform.

The shift is pushing finance teams to think differently about compensation. Tunguz estimates that for a senior engineer earning $375,000, adding $100,000 in annual inference usage increases the total employment cost by roughly a fifth. These costs, historically invisible, are now appearing as recurring expenses directly tied to productivity.

"It is starting to happen," Tunguz told Business Insider, as employee use of AI increasingly contributes to company cash burn and draws scrutiny from CFOs.

The logic extends to output as well. If cloud infrastructure performance can be measured as profit per GPU hour, an engineer's metric might soon be productive work per dollar of inference. Tunguz, who uses AI tools to automate 31 tasks a day, says his personal annual inference cost sits around $12,000. Scale that across teams deeply embedded in generative AI workflows, and usage becomes more than a budgetary concern – it's part of the job.

Peter Gostev, who leads AI capability at model-performance startup Arena, has proposed one idea to make this transparent: job listings that advertise not only salary ranges but also token budgets. These, he says, would give candidates a clear picture of the compute access they can expect before joining. Inference access, once a technical footnote, is now becoming a recruiting tool.

Permalink to story:

 
And let's not forget about how heavoly subsidized AI is right now because anyone with a data center is in the "custome acquisition" part of the process. AI is the most energy.intensive thing most people do with compute. Whether it's generating matter for gooners to fap to or if you're automating your billing receipts into a spreadsheet. This is all financed with debt. AI costs are going to go up, not down, as companies have to start to pay to maintain their debt and their hardware will likely be obsolete before that debt is paid off.

Also, I thought it takes 4 tokens to generate 1 word as AI typically generated 4 outcomes and then picks the next one before moving onto the next word.
 
Hold up....

The tools they need to do their job are considered employment compensation?
Considering that my healthcare in the US is considered as employment compensation and I still partially pay tax on it, yes. Welcome to the US.
 
Token budgets are the technologically illiterate way to measure usage. Tokens are not equal across models. One may use twice as many as another to accomplish the same thing by nature of model architecture, system prompt, user prompt, and any other tuning applied that may be invisible to end users.

Measure in actual inference time and compute needed. Token counting is for bean counting managers that don't understand tech, not tech workers who might.
 
Hold up....

The tools they need to do their job are considered employment compensation?
Considering that my healthcare in the US is considered as employment compensation and I still partially pay tax on it, yes. Welcome to the US.
No, read the article again:

"Tunguz, who uses AI tools to automate 31 tasks a day, says his personal annual inference cost sits around $12,000."

The keyword here is "personal". This guy is so addicted to AI search results that he is racking up tens of thousands worth of usage charges, somehow, and wants his employer to pay for it since he is an AI guru who works with AI and knows so much about AI. And they made a whole article about it.

This is no different then the NFT bros who got compensated with NFTs and crypto currency that ended up worthless years later.
 
No, read the article again:

"Tunguz, who uses AI tools to automate 31 tasks a day, says his personal annual inference cost sits around $12,000."

The keyword here is "personal". This guy is so addicted to AI search results that he is racking up tens of thousands worth of usage charges, somehow, and wants his employer to pay for it since he is an AI guru who works with AI and knows so much about AI. And they made a whole article about it.

This is no different then the NFT bros who got compensated with NFTs and crypto currency that ended up worthless years later.
the companies are begging us to use AI, this guy is doing that. he isn't making tiktok videos, he is automating his job per what every layoff hungry company wants right now
 
Back