‘Moving On’—investigating inventors’ ethnic origins using supervised learning
نویسندگان
چکیده
Abstract Patent data provides rich information about technical inventions, but does not disclose the ethnic origin of inventors. In this article, I use supervised learning techniques to infer information. To do so, construct a dataset 96′777 labeled names and train an artificial recurrent neural network with long short-term memory (LSTM) predict origins based on names. The trained achieves overall performance 91.4% across 18 origins. model investigate 2.68 million inventors provide novel descriptive evidence regarding their composition over time countries technological fields. global has become more diverse last decades, which was mostly due relative increase Asian Furthermore, prevalence foreign-origin is especially high in USA, also increased other high-income economies. This mainly driven by inflow non-Western into emerging high-technology fields for countries.
منابع مشابه
The Social Origins of Inventors∗
In this paper we merge three datasets individual income data, patenting data, and IQ data to analyze the deterninants of an individual’s probability of inventing. We find that: (i) parental income matters even after controlling for other background variables and for IQ, yet the estimated impact of parental income is greatly diminished once parental education and the individual’s IQ are controll...
متن کاملThe Agglomeration of US Ethnic Inventors
The ethnic composition of US inventors is undergoing a signi cant transformation with deep impacts for the overall agglomeration of US innovation. This study applies an ethnic-name database to individual US patent records to explore these trends with greater detail. The contributions of Chinese and Indian scientists and engineers to US technology formation increase dramatically in the 1990s. ...
متن کاملThe Social Origins and IQ of Inventors∗
In this paper we merge three datasets individual income data, patenting data, and IQ data to analyze the determinants of an individual’s probability of inventing. We find that: (i) parental income is positively associated with the probability of inventing, yet the estimated impact of parental income is greatly diminished once parental socioeconomic status, parental education, and the individual...
متن کاملIncidental Supervision: Moving beyond Supervised Learning
Machine Learning and Inference methods have become ubiquitous in our attempt to induce more abstract representations of natural language text, visual scenes, and other messy, naturally occurring data, and support decisions that depend on it. However, learning models for these tasks is difficult partly because generating the necessary supervision signals for it is costly and does not scale. This...
متن کاملQuery Segmentation Using Supervised Learning
Improving both recall and precision to return the specific results that the user originally intended to find is an important aspect of search engine improvements. One component of this problem is related to translating the users’ intention to the best query sent to a search engine in order to get the most relevant results. For example, if a user is looking for the cheapest 8 GB flash drive, she...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Economic Geography
سال: 2023
ISSN: ['1468-2710', '1468-2702']
DOI: https://doi.org/10.1093/jeg/lbad001