Gaétan de Rassenfosse and Samuel Arnod-Prin’s IPRoduct platform, a crowdsourced project to link products to patents, is now live. There are currently four ways to contribute. You can help train machine-learning algorithms by classifying web documents, enrich data on companies, submit relevant web documents, or share pictures of patented products. Each contribution is rewarded with credits that will allow you to download the data when that feature is supported.
Register here to start exploring the platform!
Sam Arts, Jianan Hou, Juan Carlos Gomez have a paper “Natural Language Processing to Identify the Creation and Impact of New Technologies in Patent Text: Code, Data, and New Measures” forthcoming as a research note in Research Policy. The paper is available as a pre-release here, Python code on GitHub, and data on Zenodo.
Cyril Verluise and Gaétan de Rassenfosse’s patCit project, a Comprehensive Patent Citations Dataset, is available on BigQuery and Zenodo, with open-source code on GitHub. v0.3 includes major improvements to citations tables, and the addition of category-specific tables for a range of citations.