Translation Readiness Index: Measuring Patent-Paper Proximity from Scientific Publication Text
Abstract
Universities, funders, investors, and policy agencies often need to identify research with translational relevance before patents, licenses, startups, or industry collaborations are visible. This study introduces the Translation Readiness Index (TRI), a text-based measure evaluating a publication's semantic similarity to papers that appear in high-confidence patent-paper pairs. Using 20,610 publications from OpenAlex, including 9,431 publications from the Reliance on Science patent-paper pairs d...
Description / Details
Universities, funders, investors, and policy agencies often need to identify research with translational relevance before patents, licenses, startups, or industry collaborations are visible. This study introduces the Translation Readiness Index (TRI), a text-based measure evaluating a publication's semantic similarity to papers that appear in high-confidence patent-paper pairs. Using 20,610 publications from OpenAlex, including 9,431 publications from the Reliance on Science patent-paper pairs data and 11,179 matched comparison publications, we created paper-level 768-dimensional semantic embeddings from titles and abstracts with SPECTER2. After evaluating four machine learning classifiers, XGBoost achieved the highest ROC-AUC (0.77). We define TRI as the model-estimated probability that a publication belongs to the patent-paper-paired class. Linguistic analysis revealed that patent-paired publications more often use an invention-oriented framing, distinct from the observational language of the comparison group. External validation across University of Western Australia (UWA) publications and leading global universities demonstrated positive associations between high TRI scores and independent translational indicators. TRI provides a text-based method for identifying translation-ready research, though it should be interpreted as a measure of semantic proximity to patented science rather than a direct measure of realized commercialization.
Source: arXiv:2606.31102v1 - http://arxiv.org/abs/2606.31102v1 PDF: https://arxiv.org/pdf/2606.31102v1 Original Link: http://arxiv.org/abs/2606.31102v1
Please sign in to join the discussion.
No comments yet. Be the first to share your thoughts!
Jul 1, 2026
Environmental Science
Economics
0