Courtesy: Perplexity generated
OpenAI's GDPval evaluation framework covers 44 occupations selected from the top 9 industries that contribute most to the U.S. GDP. These occupations are predominantly knowledge work roles and were chosen based on US Bureau of Labor Statistics and O*NET data. The 44 occupations include a broad range of jobs where AI could have the highest impact on real-world productivity. Here is the list of the 44 occupations included in the GDPval evaluation:
1. Software Engineers / Developers
2. Lawyers
3. Registered Nurses
4. Financial Advisors
5. Social Workers
6. Industrial Engineers
7. Real Estate Sales Agents
8. Customer Service Representatives
9. Pharmacists
10. Private Detectives
11. Video Editors
12. Marketing Managers
13. Accountants
14. Civil Engineers
15. Graphic Designers
16. Human Resources Specialists
17. Mechanical Engineers
18. Sales Managers
19. Data Analysts
20. Management Consultants
21. IT Support Specialists
22. Financial Analysts
23. Web Developers
24. Administrative Assistants
25. Paralegals
26. Medical Technologists
27. Network Administrators
28. Content Writers
29. Business Analysts
30. Insurance Underwriters
31. Architects
32. Editors
33. Database Administrators
34. Translators
35. Event Planners
36. Journalists
37. Health Educators
38. Editors
39. Quality Control Analysts
40. Purchasing Agents
41. Loan Officers
42. Market Research Analysts
43. Training and Development Specialists
44. Construction Managers
The 44 occupations spanning 9 industries that contribute over 5% to U.S. GDP, are grouped by industries:
Real Estate and Rental and Leasing
- Concierges
- Property, real estate, and community association managers
- Real estate sales agents
- Real estate brokers
- Counter and rental clerks
Government
- Recreation workers
- Compliance officers
- First-line supervisors of police and detectives
- Administrative services managers
- Child, family, and school social workers
Manufacturing
- Mechanical engineers
- Industrial engineers
- Buyers and purchasing agents
- Shipping, receiving, and inventory clerks
- First-line supervisors of production and operating workers
Professional, Scientific, and Technical Services
- Software developers
- Lawyers
- Accountants and auditors
- Computer and information systems managers
- Project management specialists
Health Care and Social Assistance
- Registered nurses
- Nurse practitioners
- Medical and health services managers
- First-line supervisors of office and administrative support workers
- Medical secretaries and administrative assistants
Finance and Insurance
- Customer service representatives
- Financial and investment analysts
- Financial managers
- Personal financial advisors
- Securities, commodities, and financial services sales agents
Retail Trade
- Pharmacists
- First-line supervisors of retail sales workers
- General and operations managers
- Private detectives and investigators
Wholesale Trade
- Sales managers
- Order clerks
- First-line supervisors of non-retail sales workers
- Sales representatives, wholesale and manufacturing, except technical and scientific products
- Sales representatives, wholesale and manufacturing, technical and scientific products
Information
- Audio and video technicians
- Producers and directors
- News analysts, reporters, and journalists
- Film and video editors
- Editors
These occupations were selected based on their economic significance and predominance of knowledge work, with tasks crafted and reviewed by experienced professionals to reflect authentic real-world deliverables. And the potential for AI to augment or replicate certain job functions. The GDPval evaluation tests AI on 1,320 representative work tasks from these occupations, designed by professionals with an average of 14 years’ experience, focusing on realistic deliverables like legal briefs, engineering blueprints, nursing care plans, customer support conversations, and more [1][3][4].
Citations:
[1] OpenAI: GDPval Framework Tests AI On Real-world Jobs https://dataconomy.com/2025/09/26/openai-gdpval-framework-tests-ai-on-real-world-jobs/
[2] OpenAI Releases List of Work Tasks ChatGPT Can Already ... https://futurism.com/future-society/openai-work-tasks-chatgpt-can-already-replace
[3] Measuring the performance of our models on real-world tasks https://openai.com/index/gdpval/
[4] Can AI do your job? OpenAI's new test reveals how it ... https://www.tomsguide.com/ai/chatgpt/openai-is-now-testing-chatgpt-against-humans-in-44-different-occupations-from-lawyers-and-software-developers-to-registered-nurses-heres-the-full-list-of-jobs-affected
[5] New benchmark for economically viable tasks across 44 ... https://www.reddit.com/r/singularity/comments/1nqef1l/new_benchmark_for_economically_viable_tasks/
[6] gdpval: evaluating ai model performance https://cdn.openai.com/pdf/d5eb7428-c4e9-4a33-bd86-86dd4bcf12ce/GDPval.pdf
[7] OpenAI https://x.com/OpenAI/status/1971249375356899445
[8] OpenAI Releases List of Work Tasks It Says ChatGPT Can ... https://au.news.yahoo.com/openai-releases-list-tasks-says-110000843.html
No comments:
Post a Comment