Download presentation
Presentation is loading. Please wait.
1
Anna Bogomolova, Tatyana N. Yudina, Oleg Karasev, Ruslan Sennov University Information System RUSSIA: RF Social and Budget Statistics Modules with Research-assisting Services. System of Subject Headings to Cross-Search Data and Documents on Public Finances Research Computing Center of Moscow State University NCO Center for Information Research
2
SourceRetrospectiveDocuments Government documents/ State agencies Publication /Kodeks Law Firm 1990-…55,000 +700,000 State Duma daily records State Duma1994-…150,000 State StatisticsState Statistics Agency; RF Ministries, CIS Interstate Statistics Committee 1998-…55,000 Mass mediaExpert weekly; Nezavis. gazeta; Izvestia; … 199(7)-…400,000 State agencies analytical reports, Think tanks reports Central Bank of RF; Russian-European Center for Economic Policy; Fiscal Policy Center 1996-…20,000 Academic publications MSU Bulletins, Economic Forecasting, Sociology Research, Law… 1999-…3,000 +230,000ref +40,000 in English University Information System RUSSIA Collections 1 5 University Information System RUSSIA Collections 1 500,000/ 17.5Gb (www.cir.ru)
4
Sociopolitical Thesaurus 70,000 concepts, 110,000 conceptual relations constructed specially as a tool for automatic text processing; contains terms from economic, financial, political, military, social, legislative and cultural domains; a set of relations is adapted to information-retrieval applications; regularly tested during automatic text processing
5
THESAURUS for Information Retrieval in Sociopolitical Domain Thesaurus provides for query refinement - reformulation/expansion Terminology of Thesaurus covers 95-98% of words and terms of Russian government publications, academic papers and mass media texts from 1991 Thesaurus is a main element of ALTP/automatic linguistic text processing technology.
6
Query Refinement
7
Thematic modules University Information System RUSSIA includes: Module of Socioeconomic State Statistics of Russia Budget Statistics Module Module of documents of the European Court of Human Rights
9
System of Subject Headings for Budget Data 87 hierarchic categories First level categories are: Macroeconomic Indicators Budget Revenues and Expenditures Tax Concessions Budget Deficit/Surplus State and Municipal Debt Budget Process Budget Federalism Extra-Budgetary Funds State Authorities Fiscal Misconduct
10
Category Description “Tariffs of Natural Monopolies” Tariffs & natural monopoly Tariffs & (gas or electricity or housing and public utilities or railway service) Tariffs & (Unified Energy System of Russia or Gasprom)
11
Further developments Including microdata Developing and testing of budget thesaurus Developing databases of socioeconomic and budgetary statistics
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.