Wednesday, October 1, 2025

OpenAI claim to be better than human in 44 jobs

Courtesy: Perplexity generated 

OpenAI's GDPval evaluation framework covers 44 occupations selected from the top 9 industries that contribute most to the U.S. GDP. These occupations are predominantly knowledge work roles and were chosen based on US Bureau of Labor Statistics and O*NET data. The 44 occupations include a broad range of jobs where AI could have the highest impact on real-world productivity. Here is the list of the 44 occupations included in the GDPval evaluation:

1. Software Engineers / Developers  
2. Lawyers  
3. Registered Nurses  
4. Financial Advisors  
5. Social Workers  
6. Industrial Engineers  
7. Real Estate Sales Agents  
8. Customer Service Representatives  
9. Pharmacists  
10. Private Detectives  
11. Video Editors  
12. Marketing Managers  
13. Accountants  
14. Civil Engineers  
15. Graphic Designers  
16. Human Resources Specialists  
17. Mechanical Engineers  
18. Sales Managers  
19. Data Analysts  
20. Management Consultants  
21. IT Support Specialists  
22. Financial Analysts  
23. Web Developers  
24. Administrative Assistants  
25. Paralegals  
26. Medical Technologists  
27. Network Administrators  
28. Content Writers  
29. Business Analysts  
30. Insurance Underwriters  
31. Architects  
32. Editors  
33. Database Administrators  
34. Translators  
35. Event Planners  
36. Journalists  
37. Health Educators  
38. Editors  
39. Quality Control Analysts  
40. Purchasing Agents  
41. Loan Officers  
42. Market Research Analysts  
43. Training and Development Specialists  
44. Construction Managers  


The 44 occupations spanning 9 industries that contribute over 5% to U.S. GDP, are grouped by industries:

Real Estate and Rental and Leasing
- Concierges
- Property, real estate, and community association managers
- Real estate sales agents
- Real estate brokers
- Counter and rental clerks

Government
- Recreation workers
- Compliance officers
- First-line supervisors of police and detectives
- Administrative services managers
- Child, family, and school social workers

Manufacturing
- Mechanical engineers
- Industrial engineers
- Buyers and purchasing agents
- Shipping, receiving, and inventory clerks
- First-line supervisors of production and operating workers

Professional, Scientific, and Technical Services
- Software developers
- Lawyers
- Accountants and auditors
- Computer and information systems managers
- Project management specialists

Health Care and Social Assistance
- Registered nurses
- Nurse practitioners
- Medical and health services managers
- First-line supervisors of office and administrative support workers
- Medical secretaries and administrative assistants

Finance and Insurance
- Customer service representatives
- Financial and investment analysts
- Financial managers
- Personal financial advisors
- Securities, commodities, and financial services sales agents

Retail Trade
- Pharmacists
- First-line supervisors of retail sales workers
- General and operations managers
- Private detectives and investigators

Wholesale Trade
- Sales managers
- Order clerks
- First-line supervisors of non-retail sales workers
- Sales representatives, wholesale and manufacturing, except technical and scientific products
- Sales representatives, wholesale and manufacturing, technical and scientific products

Information
- Audio and video technicians
- Producers and directors
- News analysts, reporters, and journalists
- Film and video editors
- Editors

These occupations were selected based on their economic significance and predominance of knowledge work, with tasks crafted and reviewed by experienced professionals to reflect authentic real-world deliverables. And the potential for AI to augment or replicate certain job functions. The GDPval evaluation tests AI on 1,320 representative work tasks from these occupations, designed by professionals with an average of 14 years’ experience, focusing on realistic deliverables like legal briefs, engineering blueprints, nursing care plans, customer support conversations, and more [1][3][4].

Citations:
[1] OpenAI: GDPval Framework Tests AI On Real-world Jobs https://dataconomy.com/2025/09/26/openai-gdpval-framework-tests-ai-on-real-world-jobs/
[2] OpenAI Releases List of Work Tasks ChatGPT Can Already ... https://futurism.com/future-society/openai-work-tasks-chatgpt-can-already-replace
[3] Measuring the performance of our models on real-world tasks https://openai.com/index/gdpval/
[4] Can AI do your job? OpenAI's new test reveals how it ... https://www.tomsguide.com/ai/chatgpt/openai-is-now-testing-chatgpt-against-humans-in-44-different-occupations-from-lawyers-and-software-developers-to-registered-nurses-heres-the-full-list-of-jobs-affected
[5] New benchmark for economically viable tasks across 44 ... https://www.reddit.com/r/singularity/comments/1nqef1l/new_benchmark_for_economically_viable_tasks/
[6] gdpval: evaluating ai model performance https://cdn.openai.com/pdf/d5eb7428-c4e9-4a33-bd86-86dd4bcf12ce/GDPval.pdf
[7] OpenAI https://x.com/OpenAI/status/1971249375356899445
[8] OpenAI Releases List of Work Tasks It Says ChatGPT Can ... https://au.news.yahoo.com/openai-releases-list-tasks-says-110000843.html