Patent analytics based on feature Vector Space Model: a case of IoT

Lei, Lei, Qi, Jiaju, and Zheng, Kan (2019) Patent analytics based on feature Vector Space Model: a case of IoT. IEEE Access, 7. pp. 45705-45715.

[img] PDF (Published Version) - Published Version
Restricted to Repository staff only

View at Publisher Website: https://doi.org/10.1109/ACCESS.2019.2909...
22


Abstract

The number of approved patents worldwide increases rapidly each year, which requires new patent analytics to efficiently mine the valuable information attached to these patents. The vector space model (VSM) represents documents as high-dimensional vectors, where each dimension corresponds to a unique term. While originally proposed for information retrieval systems, VSM has also seen wide applications in patent analytics and is used as a fundamental tool to map patent documents to structured data. However, the VSM method suffers from several limitations when applied to patent analysis tasks, such as loss of sentence-level semantics and curse-of-dimensionality problems. In order to address the above limitations, we propose patent analytics based on feature VSM (FVSM), where the FVSM is constructed by mapping patent documents to feature the vectors extracted by convolutional neural networks (CNNs). The applications of FVSM for three typical patent analysis tasks, i.e., patents similarity comparison, patent clustering, and patent map generation, are discussed. A case study using patents related to the Internet of Things (IoT) technology is illustrated to demonstrate the performance and effectiveness of FVSM. The proposed FVSM can be adopted by other patent analysis studies to replace VSM, based on which various big data learning tasks can be performed.

Item ID: 58291
Item Type: Article (Research - C1)
ISSN: 2169-3536
Keywords: CNN, IoT, patent analysis, VSM
Date Deposited: 15 May 2019 07:52
FoR Codes: 40 ENGINEERING > 4006 Communications engineering > 400608 Wireless communication systems and technologies (incl. microwave and millimetrewave) @ 20%
46 INFORMATION AND COMPUTING SCIENCES > 4602 Artificial intelligence > 460208 Natural language processing @ 80%
SEO Codes: 89 INFORMATION AND COMMUNICATION SERVICES > 8901 Communication Networks and Services > 890103 Mobile Data Networks and Services @ 100%
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page