APTO: Collaborative Research: Global Observatory and Virtual Laboratory for Science and Technology Advance
Over the past half-century, the global geopolitical balance of scientific, technological, and economic leadership has shifted, with China's meteoric rise and the ascendance of new powers including Korea and India. Technological leadership requires driving advances and setting standards that catalyze the future of global productivity. To understand pathways that enhance U.S. competitiveness in critical technology capacity, production, and use, this project will create a global observatory and virtual laboratory for U.S. science and technology in the context of global advancement. It will produce data sets and technology outcome models that capture the complex and emergent interdependencies among technologies; the funders, resources, researchers, and universities that catalyze and invent them; the workforces and organizations that produce them; and the markets that consume them. Drawing upon the power of deep neural network "transformer" architectures, the project will then build a deep-learned, chronologically trained, large language model (LLM) to function as a data-driven "digital double" of the global techno-scientific system.
The LLM will embed research artifacts (e.g., articles, patents, products, related news, and their rich meta-data) in a high-dimensional space, mapping them to quantitative metrics of technology capability, production, and use. The project team will fine-tune our LLMs to capture changes in key metrics as corresponding trajectories within embedding space, and thus enable them to function as 1) a global observatory for technology catalysis, capacity, production, and use; and 2) a virtual laboratory for simulated experiments that can guide 3) causal estimation of relationships among policy levers (funding, competition, immigration), technology performance, and global leadership. They will also tune the LLMs and related models to enable customized extraction, structuring, and disambiguation of data on research, products, funding, and policy from novel sources to enrich modeled observations and predictions, which will enable the continuous incorporation of additional data and extraction of insight. Finally, they will use the models as resources for scientists and policymakers by building dashboards to provide funding agencies, policymakers, and researchers with the situational awareness required to improve the quality and diversification of their technology development portfolios.
Investigator
Supported by the National Science Foundation through a subcontract with the University of Chicago grant #2404112
Related
More from NBER
In addition to working papers, the NBER disseminates affiliates’ latest findings through a range of free periodicals — the NBER Reporter, the NBER Digest, the Bulletin on Retirement and Disability, the Bulletin on Health, and the Bulletin on Entrepreneurship — as well as online conference reports, video lectures, and interviews.

- Feldstein Lecture
- Presenter: Cecilia E. Rouse

- Methods Lectures
- Presenter: Susan Athey