Skip to main content

This record contains commonly requested metadata for the NBER Working Paper series in various delimited UTF-8 or Stata .dta formats suitable for analysis by statistical or general purpose software. It can also load into spreadsheets with a single click. These files are intended for statistical analysis. Several formats for bibliographic software are linked from the NBER Information for Libraries page.

Available formats:

All of these files should update automatically each week.

For the character files the first column is always the Working Paper number, then a tab, then the listed variable(s). The first row has variable names. Subordinate tables, like working_papers_authors or working_papers_jels, have multiple rows per paper, often organized by an order_num field. So the row in working_papers_authors with order_num = 0 is the first author, order_num = 1 is the second author, etc.

These files have one observation per paper, or one observation for each paper-author combination, paper-program or paper-jel combination. File names and content for the tab delimited versions:

abs.tsvabstract
date.tsvissue date
jel.tsvJEL codes
prog.tsvNBER program names
proj.tsvNBER project names
auth.tsvauthor name
auths.tsvauthors, (paper, authors) (comma delimited for multiple authors)
title.tsvTitles
published.tsvBook or Journal Publication information

There is also ref.tsv which combines paper number, author, title and issue date in a single line like a journal reference. In this file the author's names for each paper are concatenated into a single string. Variable names by file:
 

abs.tsv:paper,abstract
author_user.tsv:paper,author_user
auths.tsv:paper,name
date.tsv:paper,issue_date
jel.tsv:paper,jel
prog.tsv:paper,program
proj.tsv:paper,project
project.tsv:paper,project
published.tsv:paper,published_text
title.tsv:paper,title
ref.tsv:paper,author,title,issue_date,doi

Clicking on the file name in the tsv subdirectory will open the file in a spreadsheet, for most users. Stata versions are in ./dta. Here is an example of a Stata program that prepares a bibliography of all papers in the Health Economics program:

Sample Stata code to list all papers in the HE program:

use "http://data.nber.org/nber_paper_chapter_metadata/dta/ref.dta"
keep paper author title
save ref,replace
clear
use "http://data.nber.org/nber_paper_chapter_metadata/dta/prog.dta"
keep if program=="EH"
merge 1:1 paper using ref 
list in 1/3

Working paper numbers have 4 or 5 digits and usually begin with a "w." Example "w12345". However, in years past the prefix letter could be an "h" (for Historical Paper series) or a "t" (for Technical Paper series). Both of these series are no longer distributed as new papers in favor of the single "w" coded Working Paper series. 

The published.tsv file shows where working papers were eventually published. It is not comprehensive. It combines information from authors, Crossref, RePEc and online vitae and journal names may vary by source.

Questions should be directed to:

Daniel Feenberg
feenberg@nber.org
+1 617-682-6204

Alternative format

A different extract with slightly different fields and parameters is available under

and as the gzipped files under

The files in those directories include dumps from our canonical database for tracking working papers and chapters, with some private information redacted.

The gzipped files (*.csv.gz) use alternative characters as the comma and quote characters, hex characters BF and AC.

The working_papers table is the main table, with one row per paper. Chapters in NBER books are also listed in that table, with ID numbers with 'c' in front.

Other subordinate tables, like working_papers_authors or working_papers_jels, have multiple rows per paper, often organized by an order_num field. So the row in working_papers_authors with order_num = 0 is the first author, order_num = 1 is the second author, etc.

 

More from NBER

In addition to working papers, the NBER disseminates affiliates’ latest findings through a range of free periodicals — the NBER Reporter, the NBER Digest, the Bulletin on Retirement and Disability, the Bulletin on Health, and the Bulletin on Entrepreneurship — as well as online conference reports, video lectures, and interviews.

2024, 16th Annual Feldstein Lecture, Cecilia E. Rouse," Lessons for Economists from the Pandemic" cover slide
  • Feldstein Lecture
Cecilia Rouse, president of the Brookings Institution and a professor at Princeton University, who chaired the Council...
 2024 Methods Lecture, Susan Athey, "Analysis and Design of Multi-Armed Bandit Experiments and Policy Learning"
  • Methods Lectures
Background Materials:backgroundAthey, Susan, Undral Byambadalai, Vitor Hadad, Sanath Kumar Krishnamurthy, Weiwen Leung...