Microsoft starts canceling Claude Code licenses

hckrnws

Microsoft starts canceling Claude Code licenses

(theverge.com)

452

441

by robertkarl

https://archive.ph/WfCta

harimau777
7314h
The comments I see recommending selective use of cheaper models doesn't match the reality I experience working in the industry. I have the constant threat hanging over my head of being fired if I don't churn out code quickly enough. I'm not willing to gamble with my livelyhood by using a less effective model.
Saving money on tokens isn't something that's rewarded during performance reviews; particularly because it's difficult to quantify how much you saved versus hypothetically using a more expensive model.
1. bob1029
  1211h
  I think quantifying tokens used is analogous to quantifying the amount of sawdust generated on a construction site.
  Churning out useful code quickly is not solved by using more tokens per unit time. Most non-technical leaders can grasp this one and are likely more interested in the strategic game theoretical dynamics that are being forced by way of implied token consumption expectations (competition between developers).
  If you want to hold out as long as possible and don't really care about anything other than the compensation package, you should at least play along with this new game in a half-assed manner. Try to goldilocks your token usage between any established extremes. You want to be in the statistical barycenter of every AI report that management can create.
  1. theptip
    610h
    To understand the token count thing - spending tokens is necessary and not sufficient to demonstrate that you are adopting AI.
    Where we were 6mo ago is that a lot of big orgs realized they were behind, and needed some way of measuring if the tools were usable at all.
    No sawdust at all on your job site, and you can tell nobody is cutting wood.
    Now that tooling is more mature, you can measure things like % of diffs AI-generated, % of AI suggestions accepted vs edited, % of KB queries successful etc - all more useful than raw token count for quantifying how your org is using the tool.
    So it’s a pragmatic metric that got a bit Goodhearted.
    tharkun__
    26h
2. gchamonlive
  711h
  > I have the constant threat hanging over my head of being fired if I don't churn out code quickly enough.
  And the tragedy is that this isn't sustainable, and we all involved deeply in tech know this. There is eventually going to be a big reality check the companies will have to pay, because you can't force creativity and quality, not even with AI, because actual intelligence lies with us at least for now and for the foreseeable future. However when the rope eventually snaps these executives at best will fall upwards, with big severance bonuses and a list of "contributions" we have to be grateful for. We are the ones that will suffer through the next big layoffs.
  1. drob518
    210h
3. Terretta
  611h
  Anyone (including ANTHROP\C) "recommending selective use of cheaper models" is spending costly human time (which costs more over time) on correcting the machine (which costs less over time). This is a bad trade.
  In cost per line of code, we have verified this is always an error unless your time is worth less than the machine (unlikely unless you consider your time to have no cost rather than considering it as your hourly rate).
  The worst thing for our productivity has been Claude Code or Claude Cowork taking a complex problem and turning around and writing bad instructions for dumb model agents then synthesizing the dumb answers into an orchestra of badness.
  The single best fix for results-per-total-cost is to ensure it reads and thinks about whole content, not snippets, and thinks with the smartest model, not agents.
  Agents should toil. Agents should neither think*, nor decide what to think about which itself is thinking.
  * Agents should “think” like ants or bees or beavers think. Any human-like thinking, *especially* intuition-like thinking, should be thought by the best model available.
  ** Nobody should be “churning out code”. In a hierarchy of coders who translate detailed specs to some computer language, developers who write software that ships on a project timeline, and engineers who accomplish business goals, engineers should “churn out” engines structured for business outcomes.
  Measured by that, the machine is leverage while reducing a variety of costs. At the same time, because most training data doesn't grok this, the machine doesn't grok it either. So it needs you to shape its toil.
4. krzyk
  3613h
  If you have such toxic environment, run.
  1. ninkendo
    2012h
    If you’re sitting under a tree in the rain and it gets soaked through and you start getting wet, finding another tree won’t help you.
    The whole industry is adjusting to the reality that the expected output of an engineer is much higher than it used to be. It’s not local to one company. You may find a better environment for the time being, but this is the direction everything is headed.
5. lumost
  211h
  This, I happily used the opus 4.6 fast mode to the tune of 5k for a project. The delivery of the project justified the 5k, if I only spent 500 but delivered the project 1 month later - I would have been in the dog house.
  1. apsurd
    19h
    Your project cost $5k in tokens? How does that work? over what time? My understanding is that most developers are given pro max plans at $200/m and are expected to max that out.
    I've been getting by on the $200/year plan by smoothing usage continuously over time.
    The pay per use is for the API so does it mean you're using the API in a custom setup?
6. giancarlostoro
  011h
  My real comment is, why were they not just using their self-hosted copies of it? Do they pay back Anthropic for use of it in Azure? Broker a deal, let Anthropic charge you drastically less to use their model AND Anthropic could have made Claude Code work directly with Azure for Microsoft employees. Pennies on the dollar, and Microsoft could do it using low use GPUs to save on cost, or stack underused GPU compute (this is how serverless was born btw - its the unused resources in a web server somewhere).
  When you consider that xAI's old data center was enough to bring Anthropic back ahead, it tells me Microsoft could host their own on underutilized previous gen GPUs that are sitting there wasting server real estate.
7. locknitpicker
  18h
  > The comments I see recommending selective use of cheaper models doesn't match the reality I experience working in the industry. I have the constant threat hanging over my head of being fired if I don't churn out code quickly enough. I'm not willing to gamble with my livelyhood by using a less effective model.
  I don't buy it. Old models such as GPT4.1 were faster than newer reasoning models, and their output was as good. Newer models end up wasting an ungodly amount of time with chain-of-thought steps which can be a complete waste of time if you have a structured prompt such as a plan or a spec.
  My experience in the real world is that users have to ration requests, and x0 models actually tend to be used far more because expensive models are left for more complex tasks.
  1. maleldil
    06h
8. bogota
  010h
  [dead]
9. cowsandmilk
  013h
  This, if you’re high performing, the company won’t question your use of tokens. If they want to limit it, they have ways to set limits on spend and usage.
iamflimflam1
9218h
From reading the article. They offered their developers both Claude code and Copilot.
What they wanted was for them to use both and feedback which was better.
The developers voted with their feet and didn’t use Copilot.
What Microsoft were hoping was that the opposite would happen...
1. ryanhecht
  06h
  > The developers voted with their feet and didn’t use Copilot.
  This was true in January -- since then, the Copilot CLI team has spent countless hours with engineering leaders and the biggest Claude Code users at the company to understand Copilot's shortcomings, define evals to properly test them head-to-head, and close the gap between the products.
  The result? Claude Code usage was organically decreasing and Copilot CLI usage was organically increasing -- when this announcement was made, internal Copilot CLI usage had been greater than Claude Code usage for weeks!
tra3
951d
There's definitely a way to use Claude code that is token conscious.
I've tried throwing unsupervised agentic software factory workflows against the wall, and they burned through my tokens like nobody's business but didn't produce much.
Supervised, human-in-the-loop process on the other hand is much more productive but doesn't consume nearly as much. Maybe that's why everyone's pushing agentic approaches so much.
1. matheusmoreira
  1517h
  Yeah. Claude does good work but reviewing it all properly takes quite a bit of time. It got to the point I started having trouble maxing out my weekly allocation.
  Dealt with that by going all out and making an agentic parallel code review skill. Basically an infinite TODO list generator. Now I'm definitely getting 100% of the usage I paid for. It really burns tokens like nobody's business, and catches a lot of issues while at it. I've been looping this review/fix process every week. It's dramatically reduced the amount of stuff I need to pay attention to during my human review sessions.
relevant_stats
61d
So, snippet from the article says the following:
> I understand that Microsoft is planning to remove most of its Claude Code licenses and push many of its developers to use Copilot CLI instead. While Claude Code has been a popular addition, it has also undermined Microsoft’s new GitHub Copilot CLI coding tool — a command line version of GitHub Copilot that runs outside of development apps like Visual Studio Code.
And people here are interpreting this as related mainly to the Claude burning too much tokens too quickly and suggesting Microsoft should rather use SomeOtherLLM©?
Is this Hacker News or rather Marketing Wars?
1. s_dev
  018h
proxysna
411d
Feels about right.
I've launched an internal demo of Claude Code and Deepseek on the same day and we burned through our monthly allowance for Claude in just over a week, with more than a half of that budget being spent in one day. With DS people are unable to go through that same amount of money in a month, not even close.
With that Claude feels like an expensive toy, while DS is a shovel, purely because developers do not feel like they are eating into a precious resource while using it. Also it does not feel like there is much of a difference in capability between Claude and DS-pro. DS-pro and flash do feel like sonnet/opus and haiku, but flash is still very-very capable.
1. onlyrealcuzzo
  361d
plaidfuji
212h
Our shop is forced to use Copilot on gov cloud, and it’s so useless I usually stick to manually coding. Its syntax is messy, it randomly combines lines together, flips order, or drops a couple tokens worth of output in the middle of a line, and for some reason it consistently drops the last line of every code block. I assume we’re getting a few versions back of GPT under the hood. But it does make me appreciate how the models of the past year or so crossed the threshold from interesting to truly productivity-enhancing.
Between Copilot, Claude, and Gemini, I still actually prefer Gemini. I do a lot of scientific writing in addition to coding and Gemini is the only model I can trust to “just be right”. This trust then transfers over to its code output.
1. totalhack
  112h
zkmon
21d
My experience is, Claude Code burns way more tokens compared to other agents, probably to ensure high levels of perceived quality, which is, most of the times not worth the bloat for the user. The bloat works for Anthropic as an advertisement at the cost of your tokens.
1. andrekandre
  11d
  its kind of weird tho, jensen also said we should be burning tons of tokens as well... 'perceived quality' cant be the only reason these ceos pushing token usage so hard can it?
rnxrx
251d
Thus does kind of beg the question: If developers are being laid off because AI is better/faster/cheaper or makes all their people 10x or whatever fig leaf, what happens if the required tooling ends up being more expensive? From the investor’s point of view is the drag of employee costs better or worse than a ballooning expense item?
1. andrewl-hn
  21d
  They lay people off and look good in front of investors. Then they hire people, talk about "growth", and once again look good in front of investors.
  This would never fly if stock market was rational. But it never is.
robertkarl • OP
31d
Cancellation effective June 30. This was a _pilot_ launched in December that accidentally consumed their 2026 yearly target spend on AI!
I expect the r/LocalLLaMA guys to be going nuts about this news.
1. thewebguyd
  21d
  From the article
  > It was part of an effort to get project managers, designers, and other employees to experiment with coding for the first time.
  I suspect they weren't as efficient as they could be with token use either. Sounds like they were trying to encourage non-developers to vibe code stuff
cbdevidal
717h
I’ve been quite content with CoPilot’s $10/mo plan. Still offers access to Claude models (limited tokens) but has no time limits like the $20 Claude plan, so no interruptions in work flow. I use one of the free models for the more pedestrian tasks then sic Claude on the particularly thorny problems. Works very well for me.
1. mellosouls
  416h
  I'm not sure if you are referring to the old or new plan?
  Github Copilot offered probably the best value and was IMO underappreciated for a long time; I've been an annual subscriber since day 1.
  The changes announced a few days ago completely revoke that value proposition, I doubt I'll continue with it.
keyle
718h
The title is somewhat bait. It reads like MSFT is using less AI, while in fact it's just a force swap to Copilot.
Arguably, Copilot is GPT 5? Not sure what the CLI offers behind the covers.
1. meowkit
  018h
  Copilot is the name for the harness / wrapper of MSFT products
  The CLI can swap to whatever model (/models) based on your subscriptions.
  The copilots on desktop or Office Apps are likely just GPT5 nano or other tiny models with cheap inference
andrewl-hn
11d
I'm surprised they even had them in a first place. Doesn't Microsoft have a deep partnership with OpenAI? Aren't all Copilot things powered by various GPT models? I would assume the two companies have barter agreements of sorts.
1. RevEng
  01d
  They do have agreements, but they aren't exclusive, and Microsoft and Open AI have had a rather public falling out over the last year.
thisislife2
010h
More here: Microsoft reports are exposing AI's real cost problem: Using the tech is more expensive than paying human employees - https://fortune.com/2026/05/22/microsoft-ai-cost-problem-tok...
gradientsrneat
08h
Related: Microsoft-owned GitHub recently switched to token-based billing:
https://github.blog/news-insights/company-news/github-copilo...
Claude tokens are priced by GitHub at a disproportionately premium price compared to Gemini and OpenAI. I wonder why?
https://docs.github.com/en/copilot/reference/copilot-billing...
tyleo
11d
Lots of these places measure employee token use with managers having dashboards. It seems like performative code production rather than making anything useful.
Speed without judgement always compounds badly.
1. andrewl-hn
  01d
  Tokens are current era' "lines of code per month"
  https://www.folklore.org/Negative_2000_Lines_Of_Code.html
skeledrew
01d
Well, that's the inevitable outcome of token-maxxing :shrugs:
maxignol
114h
This might actually be clever since Microsoft dev will be longing claude code features and might result in copilot getting way better
1. ryanhecht
  04h
  That's what we've spent the last five months doing! Have you tried the Copilot CLI recently? We've onboarded loads of feedback from Microsoft devs who were switching from Claude Code -- I'm proud of how far the team has come! This announcement comes at a time where Copilot CLI usage has been greater than Claude Code usage at Microsoft for several weeks; we've been winning hearts and minds!
visualphoenix
06h
Good luck to them! I recently had the misfortune of fighting Copilot on a Github PR and it made me want to never contribute to the project again.
loloquwowndueo
311h
Reminds me of when Steve Ballmer forbade his children to use iPods and pushed towards the Zune instead. Hahaha
1. bel8
  27h
  1) They can still use Anthropic models.
  2) Opus is not even unambiguously best at coding anymore. GPT 5.5 splits that title for some time now.
  3) I would have probably done the same in his position. Dogfood the product.
dsagent
61d
I think whats funny is that employees were most likely already covering the cost for these tools because they are useful. Companies didn't believe employees were using these tools and now have forced their usage and no longer have the costs subsidized.
Similarly companies seem to reward high token usage as a sign of someone willing to play ball with AI and again have forced higher costs on themselves for people reward hacking or using tokens out of spite.
1. QuiEgo
  51d
  There is no world where I can put my company’s data through an external site without their express consent and security sign off. I suspect at most companies there’s zero path for people to have been paying for it themselves.
sreekanth850
018h
If you properly keep documents, architecture, and decision records, token consumption can be pretty less. Iam managing everything with two codex plus sub. Repo size is 300 k loc ( backend).
usernametaken29
017h
I switched to OpenRouter and OpenCode a while ago. It is much cheaper, much much cheaper, and A LOT more reliable. Particulary Gemini was a piece of trash when it came to uptime
uniclaude
21d
That's very interesting to reconcile with the fact that not too far, Amazon employees feel incentivized to use as many tokens as possible.
1. HDThoreaun
  11d
  "incentivize to use as many tokens as possible" = "Upper management knows people dont like change so we are forcing them to come up with ways to use this thing". It does not mean that management will encourage wastefulness in the future, and it also doesnt mean that token usage from now wont be reviewed in the future. Whats to stop them from dinging your performance in november because you wasted a hundred thousand on tokens with nothing to show for it?
zabil
017h
I switched from Claude code to the GitHub copilot app recently. Since our repositories are hosted on GitHub I find the copilot app better integrated for the PR workflow with PR management available in the app. I don’t think I miss any of the features of Claude code I never thought I would make the switch but copilot upped the game.
Also it became very hard to convince management to keep both Claude code and GitHub Copilot enterprise licenses.
geoffbp
010h
How efficient is Claude at cleaning up unused code and making things more simple - as good as it is at adding code / features?
andyfilms1
101d
Surely a company as large as Microsoft is actively attempting to build their own models. They couldn't possibly have expected to stake the future of their software development on the conditions of a third party company?
1. mrweasel
  41d
  Okay, but what if you're not Microsofts size and don't have and R&D budget large enough to fund development of your own models and tools?
  This is a warning to any company, not building their own AI, that AI assisted development could become really expensive really fast and most likely won't pay off. What Microsoft is suggesting is that the current price is to high, but it's still not high enough for e.g. Anthropic to be profitable, or AI coding tools are only as good as the developers using them. So you can't meaningfully do layoffs by replacing the developers with AIs, because the cost is to high.
  How does Microsoft plan to fix CoPilot, so that the cost will be so much lower than Claude, that budget overruns won't be a problem for their own customer?
fredcallagan
07h
I have noticed particularly in recent weeks and maybe couple of months that token costs are just ridiculous. I can understand the upcoming IPOs and instinctive pressure to show profits ... but let's be honest, showcasing burning 1.3 million USD in tokens by a single developer in a month is the most ridiculous thing I have seen in my entire life. The general principles still apply. You expect investing X and have a return on such investment. Unfortunately that's not so easy to promise or expect. There's no real 1 to 1 correlation between amount of code written and returns, and even less between tokens burned and returns. I start to believe that the current token pricing approach, followed at the moment by all leading labs (especially considering OS models capabilities), is bordeline delusional ...
killerstorm
101d
The way coding agent work is fantastically wasteful. All the megabytes of code are processed over and over and over, sometimes withing just one session.
There are papers describing KV cache precomputation for commonly used documents (e.g. KVLink), but, of course, it's not a priority for model providers: they'd rather sell you more tokens, also they would rather get to AGI/ASI first than optimize usage of existing models...
1. brookst
  81d
  Claude code gets >98% KV cache hits. It’s not reprocessing unless you let the cache go cold (5 minutes, which is annoyingly short).
goldylochness
012h
after having used claude for quite some time, i would buy puts on microsoft
wg0
31d
Microsoft should host DeepseekV4 internally for its developers. And you're welcome.
1. chris_money202
  015h
  Microsoft does self host claude and gpt for GHCP
wolvoleo
01d
What's the point of eating your own dog food when the only thing you are doing is reselling other people's dog food? Microsoft don't have any competing LLM.
guluarte
11d
I think tech companies are doing layoffs partly because they need to cover AI operating expenses.
1. stock_toaster
  01d
  I think so too, otherwise why wouldn't you put that (purported) increased capacity/output into improving your existing products or creating new ones, with the headcount that you already have?
gmerc
019h
They got DeepSeek on Azure, would cut costs by 10x … if they ran it on Huawei
matt3210
018h
Tokens aren’t that much of an issue when your not evaluated on the usage
o10449366
21d
I switched from Anthropic to OpenAI after spending ~$40K in equivalent token costs using Claude over 3 months.
I found Opus 4.7 to be slow and wasteful with token usage. It's shocking how inefficient it is with tasks like bash tool usage and web searching, delegating them to a dozen subagents only to get stuck and never return until you esc and intervene. That, in addition to all of the broken tooling Anthropic built in to limit token usage like the broken monitoring tool made managing Claude a chore. I was happy to pay $200/month for Opus 4.5 when they had more capacity, but 4.7 felt like a huge step back and no longer worth the price and inconvenience.
I remember an OpenAI employee comment on the GPT5.5 release post about how they specifically geared it towards long-horizon tasks and its been a breathe of fresh air in that regard. I have five two-week long sessions going right now and there's been no degradation in performance or efficiency. It's much better at carrying rules/learnings forward even in long-running sessions and grounding/refreshing itself in verified facts when it loses context.
Its funny because in two weeks I've gotten way more done with GPT5.5 with way fewer tokens and way less handholding. I think this goes to show how important tooling and the harness is and how a capable model like Opus 4.7 can be severely handicapped by bad product decisions.
Kapura
010h
"everybody needs to use these new AI tools or you will be left behind. no! not like that! the cheap, worser ones!"
sergiomattei
11d
My impression is they're being cancelled in favor of full internal adoption of Copilot CLI, which has got much better over the past few months.
1. Shalomboy
  01d
  I'm also a big fan of Copilot CLI, especially after demoing it to a coworker who liked Claude Code.
heisenbit
112h
How would one call such a strategy? Embrace and extend comes to mind.
1. lou1306
  012h
  This has really little to do with embrace and extend. They are not taking over an open standard or anything like that.
  If anything, it's forced dogfooding, i.e., forcing their own workforce to beta-test their product.
dminik
016h
To be fair, Microsoft dogfooding something for once would be great.
la64710
012h
It seems that people are using LLMs to generate code but many complain of sub par code. I recall the early days of virtualization when folks will use it but complain about performance. HW capacity continued to improve until virtualization became de facto standard. I wonder if sub par code will become better as more powerful agents models or compute become available.
jgalt212
014h
What per cent of internal Microsoft IP runs through Anthropic? Do they not care about trade secrets, or certain groups allowed or not allowed to use tools that expose IP to external vendors?
jadar
011h
It's been said that technologies are not product. CC might be better, but at the end of the day M$ is going to want to cut costs and have employees use their own technology. Perhaps Copilot CLI is close enough, and the CC product doesn't justify the cost of the Claude (technology) license when M$ has their own technology to leverage.
Side note, it's so frustrating that The Verge puts a paywall at the fold. It makes me feel like the rest of the story is not worth reading. I'm not inclined to pay $2 to read a link that was posted on an aggregator.
nobodywillobsrv
019h
This feels like these kind of bad incentive problems we always here about on here ... Like bugs and vipers.
DeathArrow
019h
Doesn't MS have the compute to run GPT 5.5 for all its employees?
ndiddy
121d
This is an AI generated summary of a blog post (https://www.thelowdownblog.com/2026/05/microsoft-cancels-int...) which is a summary of an AI generated article (https://blazetrends.com/microsoft-cancels-claude-code-pilot-...) which is a summary of another AI generated article (https://www.themodelwire.com/article/microsoft-starts-cancel...) which is a summary of an article from The Verge (https://www.theverge.com/tech/930447/microsoft-claude-code-d...). I guess it would be better to link the Verge article instead.
1. m132
wilt6269
014h
[dead]
jasondillingham
023h
[dead]
othmarodev
01d
[dead]
mstralman
017h
[dead]
josefritzishere
01d
AI slop ruined a story about AI? This thread is a story about itself.
thadk
01d
Microsoft poorly manages token use of most expensive models in a pilot. Then they use that failure to advertise/position their own Github Copilot agents to procurement teams, over the now widely validated Claude Code-based agents.
At least Codex is trying to win validation on merit.