Download presentation
Presentation is loading. Please wait.
Published byLesley Russell Modified over 9 years ago
1
Patent Citation Networks Bernard Gress http://student.ucr.edu/~gressb01 Fannie Mae Inc., Washington DC. bernard_gress@fanniemae.com Forthcoming in The Mathematica Journal http://www.mathematica-journal.com/
2
The Patent Citation Dataset Patent citations are part of the legal patent process where the patent applicant has the duty to disclose any knowledge of 'prior art' amongst previous patents. Some objectivity in the process is provided by the government patent examiner who is supposed to be an expert in the area and who approves the final citation. The network established by patent citations allows one to trace the flow of technology through time, from patent to patent, and across fields. Studies of technological spillover effects, the impact or influence of individual patents, the rates of technological development, and other such issues, can be assisted by the consideration of patent citations.
3
The Patent Citation Dataset - continued Hall, Jaffe, and Trajtenberg, and the National Bureau of Economic Research (NBER) (http://www.nber.org/patents/). The primary database (cite75_99.zip) contains 22,309,440 pairs of pair-wise patent citation dataset on more than 3 million U.S. patents granted between January 1963 and December 2002. The secondary database (pat63_02f.txt) contains records for 3,414,910 patents with 25 fields each.
4
Structure of Primary Database (cite75_99.zip)
5
Structure of Secondary Database (pat63_02f.txt)
6
Patent Numbers Issued Serially
7
Two Types of Citation Networks A Citation Lineage –all of the progenitors and descendants by citation reference, so long as no siblings are brought into the picture A Citation Neighborhood –all those patents that are within a specified network distance of the patent of interest, regardless of relationship, including all 'siblings' and 'cousins'.
8
There are 14 nodes for the 1- generation lineage of patent #3858382: PatentLineage[3858382,1] –PatentsOfInterest ® {3858382}, –PrintRules ® {1 ® 3858382, 2 ® 1794517, 3 ® 2045678, 4 ® 2069266, 5 ® 2790591, 6 ® 3044233, 7 ® 3100569, 8 ® 3468100, 9 ® 3646723, 10 ® 4085822, 11 ® 4316353, 12 ® 4750694, 13 ® 4863125, 14 ® 5054646, 15 ® 6250501} –Relations ® {3858382 ® 4085822, 3858382 ® 4316353, 3858382 ® 4750694, 3858382 ® 4863125, 3858382 ® 5054646, 3858382 ® 6250501, 1794517 ® 3858382, 2045678 ® 3858382, 2069266 ® 3858382, 2790591 ® 3858382, 3044233 ® 3858382, 3100569 ® 3858382, 3468100 ® 3858382, 3646723 ® 3858382} –Vertexes ® {3858382, 1794517, 2045678,2069266, 2790591, 3044233, 3100569, 3468100, 3646723, 4085822, 4316353, 4750694, 4863125, 5054646, 6250501} –IndexPairs ® {{1,10},{1,11},{1,12}, {1,13},{1,14}, {1,15}, {2,1},{3,1}, {4,1},{5,1},{6,1},{7,1}, {8,1},{9,1}} –IndexRules ® {1 ® 10, 1 ® 11, 1 ® 12, 1 ® 13, 1 ® 14, 1 ® 15, 2 ® 1, 3 ® 1, 4 ® 1, 5 ® 1, 6 ® 1, 7 ® 1, 8 ® 1, 9 ® 1}
9
There are 15 nodes for the 1-generation Neighborhood of patent #3858382: PatentNeighborhood[3858382,1] –PatentsOfInterest ® {3858382} –PrintRules ® {1 ® 3858382, 2 ® 1794517, 3 ® 2045678, 4 ® 2069266, 5 ® 2790591, 6 ® 3044233, 7 ® 3100569, 8 ® 3468100, 9 ® 3646723, 10 ® 4085822, 11 ® 4316353, 12 ® 4750694, 13 ® 4863125, 14 ® 5054646, 15 ® 6250501} –Relations ® {1794517 ® 3858382, 2045678 ® 3858382, 2069266 ® 3858382, 2790591 ® 3858382, 3044233 ® 3858382, 3100569 ® 3858382, 3468100 ® 3858382, 3646723 ® 3858382, 3858382 ® 4085822, 3858382 ® 4316353, 3858382 ® 4750694, 3858382 ® 4863125, 3858382 ® 5054646, 3858382 ® 6250501} –Vertexes ® {3858382, 1794517, 2045678, 2069266,2790591, 3044233, 3100569, 3468100, 3646723, 4085822, 4316353, 4750694, 4863125, 5054646, 6250501} –IndexPairs ® {{1,10}, {1,11}, {1,12}, {1,13}, {1,14}, {1,15}, {2,1}, {3,1},{4,1},{5,1},{6,1}, {7,1}, {8,1}, {9,1}} –IndexRules ® {1 ® 10, 1 ® 11, 1 ® 12,1 ® 13, 1 ® 14, 1 ® 15, 2 ® 1, 3 ® 1, 4 ® 1, 5 ® 1, 6 ® 1, 7 ® 1, 8 ® 1, 9 ® 1}
10
Mathematica has Nice Built-in Graph Visualization Functions for Unstructured Graphs: GraphPlot GraphPlot3D ShowGraph But to Plot Graphs Over Time then Have to Use My Function: PatentPlot
11
Citation Networks Over Time - continued The 2-Generation Lineage of 3858382
12
Citation Networks Over Time - continued The 2-Generation Neighborhood of 3858382
13
GraphPlot[PatentNeighborHood[ {3858382, 4597749}, 2]]
14
A nice illustration of the spread of technology over time.
15
Coloring nodes by criteria I also add functions to color nodes and edges by different patent characteristics, e.g. –Patent Technology Category (2- and 4-digit HJT) –Patent Originality/ Generality Index –Total Number of Citations
16
GraphPlot3D[PatentNeighborhood[ 3858382, 7]]
17
GraphPlot[PatentNeighborhood[3858382,12]] Colored by technology category
18
Time Constrained The 7-Generation Neighborhood of #3858382, Colored by Technology Class
19
Network Statistics and Structure Analysis Citation Lags Network Curvature Citation Count Distributions HJT Technology Categories Originality and Generality
20
Distributions of Backward Lags
22
Network Curvature the average number of patents reached at subsequent network distances -some simple graphs and their respective curvature plots-
23
Network Curvature the average number of patents reached at subsequent network distances
24
A much larger network of 91,000 patents over 40 years Curvature graphs for each year
25
Curvature graphs for each year, all together
26
Curvature graphs for each year, all together, different view
27
Patent Technological Composition
28
HJT Technology Category Distribution
29
Cumulative distribution of patents by tech category
30
Citation Count Distributions
31
Citation Count Distributions - continued
34
Generality and Originality where J is the number of patent classes, N i is the total number of forward citations for patent i, and N i,j is the number of forward citations in each patent class for patent i. The second term is a Herfindal-type of index. The 'Originality' of Patent 'i' is the same, except with backwards citations (i.e. citations made). "Thus if a patent cites previous patents that belong to a narrow set of technologies, the originality score will be low, whereas Citing patents in a wide range of fields would render a higher score."
35
Generality and Originality - Continued Not very interesting - at least no trends over time – and seemingly no necessary relationship to the concepts they intend to measure.
36
Conclusions Mathematica is a nice platform for networks analysis There is a lot of opportunity for research in this area Don’t know what the value of this research is to the IPI-ConfEx clientele
37
References [1] B. Hall, Jaffe, Trajtenberg, "The NBER Patent Citations Data File: Lessons, Insights and Methodological Tools," 2002, http://emlab.berkeley.edu/users/bhhall/pat/ NBERpatdata.pdf http://emlab.berkeley.edu/users/bhhall/pat/ NBERpatdata.pdf [2] S. Wolfram, A New Kind of Science, : 2002
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.