According to the World Intellectual Property Office, a patent is an exclusive right granted for an invention []. From the moment a patent is filed, a full disclosure of the invention to a patent office would have occurred []. This information will be made available to the public in due course and often prior to the patent being granted []. In the health care technology context, patented innovations may include new pharmaceuticals, diagnostics, and medical devices. Often, inventors will apply for patents before clinical trials are initiated to protect the intellectual property of the innovative technology in the trial. However, in the case of low-risk medical devices, where clinical trials are not always conducted, patents may be one of the few indications of a new innovative product before it is introduced to the market. Therefore, patents may be considered as one of the first signs of innovation and are often searched in horizon scanning studies alone or in combination with other sources of weak signal detection [].
However, with the complexity of some types of health care technologies such as medical devices, as well as the advent of artificial intelligence and its integration into health care technologies, it is quite difficult to identify early signs of innovation in a timely manner. Some digital applications, for instance, evolve rather rapidly, while most class 1 (low risk) medical devices do not undergo clinical trials. Yet, these technologies serve as a primary source of weak signal detection for emerging health care technologies used by the Innovation Observatory (IO), the UK national horizon scanning and research intelligence body, funded by the National Institute for Health and Care Research [].
Furthermore, a thorough understanding of the health care technology development pathway from ideation to market is imperative to support early warning and alert systems which will, in turn, help inform strategic decision-making, the formulation of health policy, the allocation of research and development funds, and play a vital role in anticipating regulatory challenges associated with emerging technologies. Despite previous horizon scanning and forecast studies conducted in the context of the UK health care horizon scanning needs, it remains unclear how patent analysis methods are being used in forecasting technology emergence globally and how and whether current horizon scanning methods can be enhanced by the regular use of patent analysis studies [].
Anticipating health care technology innovation before market entry is a key challenge for horizon scanning activities. However, the role of patent landscape analysis within existing horizon scanning methods remains underexplored. For instance, could routine patent analysis help identify emerging MedTech opportunities earlier, thereby enabling more proactive innovation support? Might it also inform the development of regulatory frameworks to reduce delays in technology adoption? To assess the potential of integrating patent analysis into existing horizon scanning practices, we reviewed recent literature on patent studies related to health care technologies. The study had 3 main aims: to identify key intellectual property sources, such as patent databases; to examine methods used for patent retrieval and analysis; and to determine the primary purposes for using patent analysis in this context. The findings aim to support the standardization of weak signal detection and offer a methodological foundation for integrating patent landscape analysis [] into horizon scanning, both within the IO and among wider national and international bodies, particularly for early-stage technologies.
A rapid scoping review was undertaken to efficiently examine the published literature and answer the predetermined research questions on methods, tools, and purpose for patent landscape analysis []. We used a scoping review approach to explore the breadth of the subject area and to map the recent patent analysis methods research landscape, while also accelerating some of the traditionally more time-consuming aspects of a systematic review by embracing the recommendations for rapid reviews set out by the Cochrane Rapid Review Methods working group []. The use of rapid methods applied to scoping reviews has been documented as an increasing trend in big umbrella reviews due to the efficiency gain [,]. We followed the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) extension for scoping reviews for the reporting of items relevant to this review. This review did not quality appraise the included studies as this was considered out of scope for the objectives of this project. A preagreed protocol outlining the methods and objectives of this rapid scoping review was registered in the Open Science Framework in June 2024 [].
Inclusion CriteriaTo outline the inclusion criteria for this rapid scoping review, we followed the Cochrane protocol guide. The review was structured based on the SDMO framework (study type, data, methods, outcomes) []. Studies eligible for inclusion needed to meet the following criteria: (1) any type of study or conference abstract in English language, (2) data related to the health care technology were reported, (3) methods and tools used for the analysis were mentioned, and (4) all reported outcomes were considered for inclusion although they were not deemed mandatory for the inclusion of studies in this review in line with the recommendation from Higgins et al [].
Search StrategyFollowing the recommendation for rapid searches, a targeted search strategy was devised and run in Embase (Ovid), which contains MEDLINE, by an experienced information specialist and checked against the PRESS checklist []. Our search strategy consisted of subject heading terms in combination with text words in title and abstract. Given the specificity of the topic and to maximize the sensitivity of our searches, we focused on the Method concept (Patent) from the SDMO framework outlined above. This concept was searched in combination with terms for the different types of studies such as “trend,” “analysis,” “landscape,” “mining” or more broadly “research.” This search was then translated and run in 2 more databases selected for their comprehensive coverage of technological and scientific topics: IEEE Xplore digital library and Web of Science. We searched in the title, abstract, and keyword fields. Time limits were imposed to retrieve records published since 2020 to capture the most contemporary tools and sources. Due to the lack of current guidelines for the application of time limits in searches for rapid reviews, this was a pragmatic decision informed by the recently updated Cochrane Rapid Review Methods guidance and agreed with the information specialist []. This approach ensured that our review remained focused on the latest developments and advancements in the field, providing the most relevant and up-to-date information. No language limit or any other limits were used. Records were downloaded into Endnote 20 (Clarivate Analytics) for deduplication. Full search strategies with results are provided in .
Data CollectionDeduplicated results were single screened at the title and abstract stage by 3 independent reviewers (SGGM, EP, and OE) separately using Rayyan (Qatar Computing Research Institute [QCRI]), a screening software for systematic reviews []. Consultation between reviewers was practiced at this stage on multiple occasions. In case of doubt due to ambiguity of titles or lack of clear reporting in abstracts, records were included for full-text screening. At full-text screening, the same 3 reviewers proceeded independently to single screen the full texts for inclusion. One-to-one consultation was exercised regularly throughout this process, and disagreements were resolved by checking with a fourth reviewer.
Data ExtractionTo ensure data extraction followed the recommendations of rapid review methods, an abbreviated data extraction form was developed to meet the agreed objectives of this review and was reported in the published protocol []. The original data extraction form was then piloted by the reviewers testing the relevance of data fields deemed for extraction. This step did not result in further alterations to the original data extraction form. Three independent reviewers extracted data, and one independently assessed its quality. We extracted bibliographic details; study-related characteristics such as the type of technologies, health condition, and objectives of the study; methods-related data including the sources searched, the geographical coverage of the sources, the time period searched, and the use, if any, of automated methods and tools.
Data SynthesisWe used narrative and tabulated methods for summarizing and synthesizing data collected during our data extraction stage. Whenever possible, charts have been used to visually present information relevant to the objectives of this study. We used the National Institute for Health and Care Excellence (NICE) disease classification system [] to map the 96 studies (75%) [-] that reported specific disease indications or population groups. A total of 178 separate topic references throughout the 96 papers were extracted and mapped against the NICE classification. Despite some second-level disease indications (eg, COVID-19) that can be mapped to multiple NICE first-level disease classifications, each mention of disease was only mapped once to avoid inflation of results.
Our searches identified a total of 1741 records. After deduplication, 1505 remained. We sought the full text of 261 records, of which only 124 were considered for inclusion in this review. presents a PRISMA flowchart of the retrieval and selection process for this review.
Figure 1. PRISMA flowchart for study selection via searching databases. Gray shading denotes rapid methods, for example, reduced number of databases searched, single screening, and single data extraction. PRISMA: Preferred Reporting Items for Systematic Reviews and Meta-Analyses. The selection criteria focused on relevance and the potential to contribute to the review’s objectives. This rigorous process ensured the reliability and comprehensiveness of the findings presented.
Study CharacteristicsFrom the 124 studies included, 67 (54%) [-,-] were original research papers, 54 (43.5%) [-,-] corresponded to reviews, 2 were conference papers [,], and 1 short presentation []. The majority of sources considered patent databases only (85, 68%) [-,,,-,,-,,,-,-,,,-,-,-,-,-,,,,,-,,,,-,,,,,,,-,,,,,] with the rest of the included studies using a combination of different sources. These 124 studies were published in 94 different journals. Nature Biotechnology was the most frequent journal in our dataset (n=9 studies) [,,,-,-]. A table outlining the characteristics of the included studies is presented in [-].
Research Objectives of Included StudiesThe objectives of each paper were extracted verbatim directly from the papers. In order to analyze these, and in line with recommendations on how to analyze and present data from scoping reviews [], 3 reviewers (SG, EP, and OE) grouped them by common topics. This produced a final list of 9 topics or reasons (). These were presented and agreed with the rest of the review team. The most common reason reported for the study of patents (62 studies, 50%) [-,,,,,,,,,,,,-,,,,,,,,,,,-,,,-,-,-,,,,,,,,,-,-,,,,] was to investigate trends within specific fields, followed by those aiming to provide recommendations for future research, policy, and strategy development (25 studies, 20%) [,,,,,,,,,,,,,,,,,-,,,,,]. Two other frequent objectives were: the study of specific patented technologies (13 studies, 10%) [,,,,,,,,,,,,] and the identification of emerging technologies (13 studies, 10%) [,,,,,,,,,,,,]. Less frequent objectives included the development of methodological frameworks for patent analysis (4 studies) [,,,], analysis of global patent trends (3 studies) [,,], cross-sectoral applications of emerging technologies (1 study) [], summarize recent research in specific fields (1 study) [], analyze patent application pipeline (1 study) [], and develop patent research methods (1 study) [].
Table 1. Reasons for conducting a patent research study by themes.Patent analysis objectiveNumber of studiesIdentify and analyze patent trends62Provide future directions25Identify emerging technologies13Analyze patented technologies13Methodological framework4Compare global patent trends3Cross-sectoral applications1Summarize recent research1Analyze patent application pipeline1Develop patent research methods1Sources of Patent DataEighty-five studies [-,-,-,-,-,,,-,-,-,-,,,,-,,,-,,,,,-,,,,,,,,] used patents alone, whereas 39 studies [,,,,,,,,,,,,,,,-,,-,-,,,,,,,,,,,-] used various sources that included gray literature, research papers, and clinical trials. Two studies [,] did not report which patent database had been used. More than 27% (34/124) of papers described the use of more than 1 patent database. As such, across the 124 papers, there were 47 databases listed. presents the distribution of the top 10 named databases across all papers.
Table 2. Top 10 patent databases used in the included studies.Patent databaseNumber of papersDerwent Innovation30USPTO24Espacenet20The Lens20PatentScope13WIPO13EPO10Google Patents10Orbit6PatSeer5aUSPTO: United States Patent and Trademark Office.
bWIPO: World Intellectual Property Office.
cEPO: European Patent Office.
Time Horizons of Patent ScansOf the 124 included papers, 44 (35.5%) papers [,,,,,,-,-,,,,,,,,,,,,,,,,,-,,,,,,,,,,,,] did not report the time period searched during patent analysis or gave incomplete timelines with only an end date leaving 80 papers [-,-,,-,,,,,,,,,,-,,,,,,,,,,,,-,-,-,,-,,-,-,-] with completed data on time horizons.
The mean time length covered in those 80 papers was 24.6 years (SD 2.37) [-,-,-,,-,,,,,-,,,,,,,,-,,,,,-,,,,,,,,,,-,-,-,-,]. 7 papers [,,,,,,] searched a time period of over 50 years with the broadest time period searched being from 1900 to 2019 []. Around 19% (15/78) of papers with a reported time horizon searched a period between 3 months and 10 years. Up to 15% (12/78) of papers searched a 20-year period. It was common for authors to backdate searches to the start of the year 2000, although the rationale was not provided.
Methods and Automated Approaches UsedOf the 124 included papers, 43 (34.6%) studies [-,-,-,,,,] reported the use of automated or semiautomated approaches during patent analysis. In these 41 papers, there were 43 unique tools that were used for patent analysis. presents all tools categorized into whether the tool was available as an in-built analysis module by the patent database searched (IN-BUILT), a custom patent analysis script (CUSTOM), or a commercially offered tool (COMMERCIAL).
Figure 2. Reported tools used for automated or semiautomated analysis. Some studies mentioned more than 1 tool, a full account of each tool used and their frequency distribution across the 41 studies is available in . These tools were used mainly in 4 different methods: data mining (understood as the statistical technique of processing raw data in a structured form), text mining (the part of data mining which involves processing of text from documents) [], data transformation (the process of converting and cleaning raw data from one data source to meet the requirements of its new location) [], and network visualization (understood as the graphical representations of network devices, network metrics, and data flows) []. Network visualization was by far the most popular method, used in 50% of the tools described. The most popular tool for network visualization was the freely available software Gephi (Gephi Consortium), used in 7 separate studies [-,]. presents a breakdown of these tools classified by their type and the number of studies that used them.
Table 3. Full account of all named tools used for automated or semiautomated approaches for patent analysis from 41 papers that reported their use.ToolNumber of studies used inTool typeMethodReferenceR – tidyverse2CUSTOMNetwork visualization[,]R – patentr1CUSTOMNetwork visualization[]R – cowplot1CUSTOMNetwork visualization[]R – gridExtra1CUSTOMNetwork visualization[]R – lubridate1CUSTOMData transformation[]R – magrittr1CUSTOMData transformation[]R – linkcomm1CUSTOMNetwork visualization[]R – trend1CUSTOMData transformation[]Python – pandas3CUSTOMData transformation[,,]Python – patentpy2CUSTOMData transformation[,]Python – seaborn.clustermap1CUSTOMNetwork visualization[]Python-based Patent Enrichment Tool (PEMT)1CUSTOMData mining/data transformation[]Python – gensim1CUSTOMData transformation[]Python – matplotlib1CUSTOMNetwork visualization[]Latent Dirichlet Allocation algorithm1CUSTOMText mining[]Word2Vec model1CUSTOMNetwork visualization[]Term Frequency-Inverse Document Frequency (TF-IDF) model1CUSTOMText mining[]Generative Topographic Modeling1CUSTOMNetwork visualization/data transformation[]Gephi7COMMERCIALNetwork visualization[-,]VOSviewer4COMMERCIALNetwork visualization[,,,]Open Refine2COMMERCIALData transformation[,]Questel-Orbit2COMMERCIALData mining[,]Vantage Point2COMMERCIALText mining[,]UCINET 61COMMERCIALNetwork visualization[]Tableau1COMMERCIALNetwork visualization[]Patentics1COMMERCIALNetwork visualization[]Pajek1COMMERCIALNetwork visualization[]Orange-Data Mining1COMMERCIALText mining[]Cytoscape1COMMERCIALNetwork visualization[]Harzing’s Publish or Perish1COMMERCIALData mining[]ITG Insight1COMMERCIALText mining/network visualization[]OriginPro1COMMERCIALNetwork visualization[]Patsnap1COMMERCIALNetwork visualization[]PatentInspiration1COMMERCIALNetwork visualization[]NoteExpress1COMMERCIALData transformation[]Derwent: ThemeScape Map2IN-BUILTNetwork visualization[,]Derwent: Data Analyzer2IN-BUILTData transformation[,]Lens2IN-BUILTData mining[,]AcclaimIP1IN-BUILTData mining[]PatentScope1IN-BUILTText mining[]Derwent: Smart Search1IN-BUILTData transformation[]Derwent Innovation1IN-BUILTText mining[]EP full-text search1IN-BUILTData mining[]NICE Disease Classification MappingOf the 124 papers, 28 (22.6%) [-,-,-,-,-] did not sufficiently refer to any level of NICE classification. Between the remaining 96 papers [-], there were 177 specific mentions of NICE guidance categories, which are detailed below (). Overall, “Cancer” was the most prevalent NICE topic with 19.4% (19/96) of the 124 included studies reporting cancer patents as either the primary or secondary finding. Of the 16.1% of papers [,,,,,,,,,,,] that reported “Respiratory conditions,” 60% (9/15),,,,,,,, of these were specific to “COVID-19” or other coronaviruses.
Table 4. Specific mentions of NICE disease and other guidance topics as a percentage of 178 NICE guidance topics referenced in the results of 94 papers that reported specific NICE topics in their results.NICE guidance (first level)Number of mentionsPercentage of papersCancer2419.4Infections2016.1Respiratory conditions1915.3Diabetes and other endocrine, nutritional, and metabolic conditions118.9Injuries, accidents, and wounds118.9Neurological conditions118.9Blood and immune system conditions108.1Cardiovascular conditions97.3Oral and dental health97.3Skin conditions86.5Health and social care delivery86.5Digestive tract conditions64.8Mental health, behavioral, and neurodevelopmental disorders64.8Eye conditions54Musculoskeletal conditions54Urological conditions43.2Chronic and neuropathic pain21.6Liver conditions21.6Sleep and sleep conditions21.6Ear, nose, and throat conditions10.8Fertility, pregnancy, and childbirth10.8Lifestyle and well-being10.8Not reported2822.6aNICE: National Institute for Health and Care Excellence.
Our rapid scoping review aimed to explore the recent literature on patent research studies to identify patent sources, tools, and methods used to understand how patent analyses may support horizon scanning methods in the detection of early innovations. To our knowledge, this is the first rapid scoping review focused on methodological approaches for patent research.
This review has identified several key findings that may be used to inform horizon scanning methods. First, our review has revealed an array of methodological approaches, tools, and reasons why patents are being used to discover signs of innovation. We identified 47 different sources for patent data retrieval with some studies using more than one source per study; for example, clinical trials and published or unpublished literature. The compilation of these sources in this review provides an up-to-date published list of patent data sources and insights into how researchers triangulate patent data with information on how, or whether, the innovation is being used in clinical studies and emerging research. This is an important finding for horizon scanning projects in general, and those undertaken by the IO, with particular value for the prioritization of scalable innovation. Patents alone may not be enough to assess the scalability of innovations in health care services such as the UK NHS. Combining patent data with clinical trials, publications, and clinical development analysis offers a more meaningful approach to evaluating technology adoption potential.
Second, this study showed that patent data are mostly used in trend analysis, perhaps due to the accessibility to retrospective global patent data. This finding aligns with what we already know about methods used for scanning emerging technologies and predicting innovations [,]. However, this review revealed a less common but notable application of using patent analyses to guide future policy, research, and strategy, similar to the “bottom-up” approach of the European Commission’s Joint Research Centre for detecting weak signals []. Most importantly, this approach can also reveal research gaps, thereby aiding in directing future funding and policies to address these gaps, thus contributing to eliminating health inequities and disparities. These findings will contribute to consolidating the IO’s methodological application of patent trend analysis studies to underpin “the push” or “bottom-up approach” to informing UK national future policy, research, or strategy on emerging innovation trends or identified innovation gaps.
Third, from our mapping of disease areas, not surprisingly, cancer, respiratory conditions, and infections were the top 3 topics addressed by the included studies. Considering that this rapid scoping review limited the included studies to only those published between 2020 and 2023, it was expected that COVID or, more broadly, respiratory conditions appeared in our dataset quite prominently. However, the focus on cancer technologies may respond to the growing burden of cancer worldwide and its impact on underserved populations []. Likewise, the emphasis on infection topics may respond to the antimicrobial resistance crisis that we are currently facing as well as the challenges posed by the growth of communicable diseases worldwide that continue to be the cause of substantial morbidity and mortality []. The alignment with real-world health challenges highlights innovators’ responses and reinforces the need for horizon scanning systems to adopt these methods for studying innovation in major health issues.
Fourth, the analysis showed that tools used for patent data analysis primarily focus on creating graphs and network visualizations, with examples including Gephi [], VOSViewer (Leiden University) [], and ThemeScape Map (Derwent Innovation, Clarivate) []. Data manipulation tools such as Pandas (Wes McKinney in Python) [] also played a key role. In addition, Python and R were the most common programming languages used to develop custom tools. Findings from this review highlighted that data transformation and visualization are methods often used for detecting emerging trends. This finding is in line with previously published studies into patent landscaping in the field of life sciences innovation []. Since there are no established methodological frameworks for analyzing patent data in horizon scanning, based on these results, it would be reasonable to assume that whenever tools are used for patent data analysis, some level of data transformation (or curation) is needed and therefore this step and the tools needed to undertake it should be part of the repertoire of methods for any horizon scanning practitioner and, more specifically, for the IO’s methods toolkit.
Finally, our review identified a wide range of technologies for which patent analyses were undertaken (, ). Due to the short time frame of our analysis, we could not establish a methodological link between analysis methods and technology types. A longer time frame would have yielded more historical data that could have allowed some trend analysis on patent study methods by technology type. Notwithstanding, our results suggest the broad applicability of patent studies to any type of technology.
Our study has some limitations. First, we imposed a short time frame for our bibliographic database searches, which may have meant that we missed some relevant methodological studies. However, this was a pragmatic decision justified by the rapid nature of this review. Given the aims and objectives of our review, which consisted of finding methods and sources for patent landscape analysis studies, we do not think that limiting the search to the last 3 years of publication has biased our methodological conclusions; if anything, it has provided the most up-to-date view on tools and methods. Second, a longer time period would have provided more data allowing us to study the link between technology types and methods providing a more holistic view of patent analysis methods. Third, we used single screening to select studies, so we did not assess interrater agreement. While this practical choice may have led to some relevant studies being missed, we believe it would not have significantly affected our overall conclusions. Finally, our review was not able to ascertain if deduplication of data, a critical step to prevent data inflation and ensure accurate analyses of innovation clusters, was addressed by the included studies [].
ConclusionsOur findings highlight a broad range of methodological approaches and tools which may suggest that in the health care technology context, the continuous search for signs of innovation is driving methods and tools development. Our study has shed light on the usefulness and purpose of patent research studies in the context of health care technology innovation, but it has not solved some methodological aspects such as deduplication processes and possible links between technology types and methods. Notwithstanding, when conducting such studies, particular attention should be given to the processes of data manipulation, transformation, and deduplication—especially when integrating multiple data sources—and these methods should be transparently documented and reported in the final output to ensure reproducibility and data quality. Implementing existing guidelines for reporting items for patent landscapes is also strongly recommended [,].
The IO aims to provide a systematic approach to horizon scanning across the 3 horizons of health care technology development: emerging, transitional, and imminent. The findings from this study will support the practice of horizon scanning in the emerging technology horizon. In summary, we expect that the findings from this rapid scoping review would be of value to a wide range of horizon scanning practitioners as well as those planning to undertake patent research studies in the field of health care innovation.
This study would not have been possible without the support of the Innovation Observatory. We thank Oleta Williams for the Information Specialist support provided for scoping and running the bibliographic databases searches for this rapid scoping review. This project is funded by the National Institute for Health and Care Research (NIHR) (HSRIC-2016‐10009/Innovation Observatory).
The datasets generated or analyzed during this study are available from the corresponding author on reasonable request.
SGGM and AM contributed to conceptualization, methodology, supervision, and project administration. SGGM, ER, OE, EG, JE, and AM were involved in data curation, formal analysis, validation, and visualization. All coauthors participated in writing – original draft, writing – review and editing, and approval of the final manuscript.
None declared.
Edited by Taiane de Azevedo Cardoso; submitted 19.12.24; peer-reviewed by I-Chun Lai, James Kenneth Timmis, M Pruski; final revised version received 27.05.25; accepted 28.05.25; published 11.09.25.
© Sonia Garcia Gonzalez-Moral, Erin Pennock, Olushola Ewedairo, Elizabeth Green, James Elgey, Andrew Mkwashi. Originally published in the Interactive Journal of Medical Research (https://www.i-jmr.org/), 11.9.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Interactive Journal of Medical Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.i-jmr.org/, as well as this copyright and license information must be included.
Comments (0)