NBER Working Papers and Chapters Metadata

Source National Bureau of Economic Research

This repository provides comprehensive metadata for the National Bureau of Economic Research (NBER) Working Paper series. The collection includes detailed information about the papers, such as titles, authors, publication dates, abstracts, subject classifications, and eventual journal publications, all available in multiple user-friendly formats including spreadsheet-compatible files and statistical software formats. Updated automatically each week, this resource serves anyone interested in exploring research trends by offering accessible structured data.

Available formats:

All of these files should update automatically each week.

For the character files the first column is always the Working Paper number, then a tab, then the listed variable(s). The first row has variable names. Subordinate tables, like working_papers_authors or working_papers_jels, have multiple rows per paper, often organized by an order_num field. So the row in working_papers_authors with order_num = 0 is the first author, order_num = 1 is the second author, etc.

These files have one observation per paper, or one observation for each paper-author combination, paper-program or paper-jel combination. File names and content for the tab delimited versions:

abs.tsv	abstract
date.tsv	issue date
jel.tsv	JEL codes
prog.tsv	NBER program names
proj.tsv	NBER project names
auth.tsv	author name
auths.tsv	authors, (paper, authors) (comma delimited for multiple authors)
title.tsv	Titles
published.tsv	Book or Journal Publication information

There is also ref.tsv which combines paper number, author, title and issue date in a single line like a journal reference. In this file the author's names for each paper are concatenated into a single string. Variable names by file:

abs.tsv:	paper,abstract
author_user.tsv:	paper,author_user
auths.tsv:	paper,name
date.tsv:	paper,issue_date
jel.tsv:	paper,jel
prog.tsv:	paper,program
proj.tsv:	paper,project
project.tsv:	paper,project
published.tsv:	paper,published_text
title.tsv:	paper,title
ref.tsv:	paper,author,title,issue_date,doi

Clicking on the file name in the tsv subdirectory will open the file in a spreadsheet, for most users. Stata versions are in ./dta. Here is an example of a Stata program that prepares a bibliography of all papers in the Health Economics program:

Sample Stata code to list all papers in the HE program:

use "http://data.nber.org/nber_paper_chapter_metadata/dta/ref.dta"
keep paper author title
save ref,replace
clear
use "http://data.nber.org/nber_paper_chapter_metadata/dta/prog.dta"
keep if program=="EH"
merge 1:1 paper using ref
list in 1/3

Working paper numbers have 4 or 5 digits and usually begin with a "w." Example "w12345". However, in years past the prefix letter could be an "h" (for Historical Paper series) or a "t" (for Technical Paper series). Both of these series are no longer distributed as new papers in favor of the single "w" coded Working Paper series.

The published.tsv file shows where working papers were eventually published. It is not comprehensive. It combines information from authors, Crossref, RePEc and online vitae and journal names may vary by source.

Questions should be directed to:

Daniel Feenberg
feenberg@nber.org
+1 617-682-6204

Alternative format

A different extract with slightly different fields and parameters is available under

and as the gzipped files under

The files in those directories include dumps from our canonical database for tracking working papers and chapters, with some private information redacted.

The gzipped files (*.csv.gz) use alternative characters as the comma and quote characters, hex characters BF and AC.

The working_papers table is the main table, with one row per paper. Chapters in NBER books are also listed in that table, with ID numbers with 'c' in front.

Other subordinate tables, like working_papers_authors or working_papers_jels, have multiple rows per paper, often organized by an order_num field. So the row in working_papers_authors with order_num = 0 is the first author, order_num = 1 is the second author, etc.

NBER Working Papers and Chapters Metadata

Sample Stata code to list all papers in the HE program:

Alternative format

More from NBER