Download presentation
Presentation is loading. Please wait.
Published byMarsha Webster Modified over 9 years ago
1
Faceted browsing for ACL Anthology Praveen Bysani
2
ACL Anthology a digital archive of research papers in CL and NLP contains over 20,100 papers free of cost archive for sister conferences and journals
3
Current browser direct and navigational search hard to navigate non-customized search non-sortable results
4
Faceted browsing Combination of navigational and direct search paradigms Facets are properties of information elements Access to organized information Ability to explore the collection in multiple dimensions through filters
5
Faceted Browsing RoR + Blacklight plugin Apache Solr Metadata from XML Blacklight customization for XML
6
Show view
7
Index View
8
More cookies.. User Feedback Comment/ Share / Like Suggestions for correcting the meta data Ability to export bib in six formats Author pages List of publications Co-authors
9
Third-party annotations Automatically annotate articles with new metadata Anthology as a corpus API to make anthology an object of study OAI compatible allows metadata harvesting @ http://aclanthology.heroku.com/
10
Challenges Normalizing the quality of anthology meta data information SIG Information yaml files no identifiers provided DOI from acm changes in names of papers, authors
11
Similar works ACL Author Network bibliometrics ACL Search Bench Semantic search
12
Plans for the future A common data schema to integrate all Indexing the whole text data Range queries for year facet Exporting total volume bibliography Enriching author pages
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.