Download presentation
Presentation is loading. Please wait.
Published byMarlene Nichols Modified over 8 years ago
1
REGNET Gloria Lau, Haoyi Wang, Kincho Law, Gio Wiederhold Stanford University May 16th, 2005 A Relatedness Analysis Approach for Regulation Comparison and E-Rulemaking Applications
2
1 Motivation: regulatory comparison Multiple sources of regulations Multiple jurisdictions: federal, state, local, etc. Different formats, terminologies, contexts UK DDA in HTMLADAAG in HTML Amending rules, conflicting ideas IBC in PDF
3
2 Increasing amount of electronic data in e-rulemaking Example Alcohol and Tobacco Tax and Trade Bureau received over 14,000 comments in 7 months, the majority of which were emails, on a flavored malt beverages proposal Originally in the Federal Register: “All comments posted on our Web site will show the name of the commenter but will not show street addresses, telephone numbers, or e-mail addresses.” Later in the Federal Register: due to the “unusually large number of comments received,” the Bureau later announced that it was difficult to remove all street addresses, telephone numbers and email addresses “in a timely manner.” Motivation: e-rulemaking
4
3 Relatedness analysis based on a regulatory repository XML regulatory repository with features extracted Shallow parser to consolidate regulations HTML, PDF, plain text XML regulations Features, references, etc. Relatedness analysis to help understanding of regulations and the relationships between them Feature matching Structural matching Application to e-rulemaking Comparisons of drafted regulations and public comments
5
4 Development of a Regulatory Repository
6
5 reference parse tree Feature Extraction in XML … … Assembly areas with fixed seating shall comply …...
7
6 Structural comparisons Related elements: door and entrance Relatedness analysis ADAAG 4.1.6(3)(d) Doors (i) Where it is technically infeasible to comply with clear opening width requirements of 4.13.5, a projection... UFAS 4.14.1 Minimum Number Entrances required to be accessible by 4.1 shall be part of an accessible route and shall comply with...
8
7 Relatedness analysis To utilize the computational properties of regulations for a complete comparison Measure Degree of relatedness: similarity score f (A, U) (0, 1) Nodes A and U are provisions from two different regulation trees
9
8 Base score f 0 computation Linear combination of feature matching F ( A, U, i ) = similarity score between Sections ( A, U ) based on feature i N = total number of features = weighting coefficient Feature matching Based on the Vector model using cosine similarity as the distance between feature vectors Non-Boolean features A measurement of “2 inches max” can be a 70% match to “2 inches” Synonyms exist, e.g., ontology defined for chemicals Perform vector-space transformation prior to cosine computation
10
9 Score refinements based on regulation structure Neighbor inclusion Diffusion of similarity between clusters of nodes in the tree
11
10 Score refinements based on regulation structure Reference distribution Diffusion of similarity between referencing nodes and referenced nodes in the tree E.g., f (A5.3, U6.4(a)) updates f (A2.1, U3.3)
12
11 Performance evaluation Conduct a user survey of rankings of similarity 10 randomly chosen sections from the ADAAG and UFAS Ranks 1 to 100 in the order of relevance Root mean square error ( RMSE ) = user-generated ranking vector = machine-predicted ranking vector
13
12 Survey results - Tabulated RMSE’s Compared our analysis to Latent Semantic Indexing (LSI) = structural weighting coefficient = feature weighting coefficient Average RMSE smaller than LSI Measurement feature performs best No improvement in result observed for structural comparison
14
13 Results of comparisons: ADAAG vs. UFAS Related accessible elements: door and entrance No ontological information Neighbor inclusion reveals higher similarity Content of neighbors imply similarity between Section 4.1.6(3)(d) in ADAAG and Section 4.14.1 in UFAS
15
14 Results of comparisons : UFAS vs. BS8300 Terminological differences - revealed through neighbor inclusion
16
15 Results of comparisons : UFAS vs. Scottish Technical Standards Terminological differences - revealed through reference distribution Stairs and ramps
17
16 Application domain: e-rulemaking Comparison between draft of rules and the associated public comments ADAAG Chapter 11, rights-of-way draft Less than 15 pages Over 1400 public comments received within 4 months Comments ~ 10MB in size; most are several pages long New regulation draft can easily generate a huge amount of data that needs to be reviewed and analyzed Parsing of the draft and comments From HTML to XML Recreate structure of the draft using our shallow parser Extract features from the draft and comments Treat individual comments as provisions Application to e-rulemaking
18
17 Application to E-Rulemaking Drafted regulations compared with public comments
19
18 Related section in draft and public comment Results from e-rulemaking application
20
19 Results from e-rulemaking application No related provisions identified Concern not addressed in the draft
21
20 Results from e-rulemaking application Related section in draft and public comment Commenting per provision Forward to right personnel
22
21 Results from e-rulemaking application Related section in draft and public comment Suggested revision cannot be located automatically Linguistic analysis can potentially help
23
22 Results from e-rulemaking application Comment on the general intent of the draft Clustering of comments might help
24
23 Conclusions Prototype for relatedness comparisons of regulations Contextual comparisons Domain knowledge Structural comparisons Performance Evaluation, Results and Applications User survey and comparisons with LSI Observations of comparisons between Federal, State, non-profit organization mandated codes and European standards Application to e-rulemaking Compare drafted rules with public comments Observations of comparisons based on a rights-of-way draft
25
24 Future research directions Regulatory comparison Regulatory competition Cross border data transfer laws Especially in the polyglot countries in EU Regulatory updates Track changes in updates Track cross references between regulations E-rulemaking Automated routing of comment to person in charge Clustering of comments Web portal for comment submission per provision, in addition to per draft Linguistic analysis to match patterns of suggested revision embedded in comments
26
25 Thank You!
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.