The discourse around AI and jobs tends toward two extremes: either AI will replace everyone, or it’s all hype. The reality, as always, is more nuanced — and now there is data to show exactly what it looks like.
This analysis visualises the work done by JoshKale (originally inspired by Andrej Karpathy), who used an LLM to score every single occupation in the Bureau of Labor Statistics Occupational Outlook Handbook — all 342 of them — on a 0–10 AI exposure scale. The result is the most comprehensive public dataset I have seen on this topic.
The live interactive treemap lets you explore it yourself. This post goes further: it breaks the data into charts, quantifies the employment impact, and draws out the patterns that matter.
How the Scoring Works
Each occupation was fed to an LLM (Google Gemini Flash) along with its full BLS description — pay, typical tasks, required education, and employment outlook. The model scored it 0–10 on AI exposure:
| Score | Tier | Examples |
|---|---|---|
| 0–1 | Minimal | Roofers, landscapers, janitors |
| 2–3 | Low | Electricians, plumbers, firefighters |
| 4–5 | Moderate | Registered nurses, police officers |
| 6–7 | High | Teachers, managers, journalists, accountants |
| 8–9 | Very High | Software developers, graphic designers, data analysts |
| 10 | Maximum | Data entry clerks, medical transcriptionists, telemarketers |
The average across all 342 occupations: 5.3 out of 10. That is a striking number — the median American worker sits firmly in the moderate-to-high exposure zone.
Distribution of AI Exposure Scores
Where do the 342 occupations actually land?
The distribution is roughly bell-shaped but skewed right. More occupations cluster in the 5–8 range than in the 1–3 range. There are very few truly “safe” jobs — physical-labour roles make up the left tail, while pure knowledge-work and clerical roles pile up on the right.
AI Exposure by Occupation Category
The BLS groups all occupations into 22 major categories. Averaging the exposure scores within each category reveals a stark divide between knowledge work and physical work:
The pattern is clear: any work primarily done on a screen is highly exposed. Office and administrative support — the largest single occupational category by employment — tops the list. Computer occupations, legal work, and financial services are all above 7.5.
At the other end: farming, construction, building maintenance, and food preparation are below 3. These jobs require physical presence, manual dexterity, and spatial reasoning that current AI cannot meaningfully replace.
The Most Exposed Occupations
The occupations at 9–10 are those where the core task is already digital and highly routine:
Several of these are already in steep decline according to BLS projections. Court reporters, bookkeeping clerks, and bill collectors are all seeing employment fall as software catches up. The LLM score is not predicting the future — it is describing what is already happening.
The Least Exposed Occupations
The jobs most resistant to AI share a common thread: they happen in the physical world, often in unpredictable environments.
There is no machine that can reroof a house in the rain, troubleshoot a burst pipe inside a finished wall, or respond to a structure fire. These are not “low-skill” jobs — they require years of training and command solid wages. They are simply not digitizable.
How Many Workers Are Actually at Risk?
Exposure scores are interesting, but employment counts are what determine real-world impact. When you multiply exposure by the number of workers in each tier, the picture gets sobering:
~57 million American workers are in occupations scoring 7 or above. That is roughly a third of the entire US workforce. This does not mean 57 million jobs disappear tomorrow — but it means 57 million jobs will be significantly reshaped by AI tools, workflows, and automation over the coming decade.
The 30 million workers in the “minimal” tier (scores 1–3) are in the most defensible position. Ironically, many of these are among the lower-paid occupations.
Pay vs. AI Exposure
One of the more uncomfortable patterns in this data: high pay correlates with high exposure.
The top-right quadrant — high pay, high exposure — contains software developers, lawyers, financial analysts, and physicians. These are well-compensated knowledge workers whose core output (analysis, writing, code, decisions) is exactly what LLMs are optimised to produce.
The bottom-left — low pay, low exposure — contains food service workers, farm workers, and cleaners.
This creates a perverse situation: the workers who are most financially equipped to adapt (high earners) are the most exposed, while the workers least able to adapt are the most protected, for now, simply because their physical work is harder to automate.
Growth Outlook vs. Exposure
BLS employment projections through 2034 paint a consistent picture:
Occupations with high AI exposure are concentrated in the “declining” and “little or no change” buckets. Occupations with low exposure skew toward “faster than average” or “much faster than average” growth — driven by construction, healthcare, and personal services where physical presence is irreplaceable.
What This Means
A few things stand out from this dataset:
The clerical class is in the most immediate danger. Bookkeeping, data entry, billing, transcription — these are not abstract future risks. BLS is already projecting declines. The tools exist today.
Software developers are more exposed than most tech workers admit. A score of 9 does not mean replacement, but it does mean augmentation so aggressive that productivity per developer will multiply — meaning fewer developers are needed for the same output. GitHub Copilot is the early version of this. The trajectory is clear.
Healthcare is the interesting middle case. Physicians, nurses, and technicians sit at 5–6: genuinely moderately exposed. AI will handle diagnosis assistance, documentation, and research — but the physical and social dimensions of care cannot be automated away.
Physical labour is the hedge nobody talks about. An electrician or plumber with a strong customer base is more AI-proof than a senior analyst at a large financial institution. The market has not fully priced this in yet.
About This Dataset
The methodology was developed by Andrej Karpathy and subsequently implemented and published by JoshKale. The pipeline:
- Scraped all 342 occupation pages from the BLS Occupational Outlook Handbook
- Parsed each into clean Markdown (pay, tasks, education requirements, employment outlook)
- Scored each with an LLM using a detailed rubric
- Merged with BLS employment statistics
- Built an interactive treemap visualisation
The full dataset, source code, and methodology are open source at github.com/JoshKale/jobs.
Comments