Presentation is loading. Please wait.

Presentation is loading. Please wait.

Anna Bogomolova, Tatyana N. Yudina, Oleg Karasev, Ruslan Sennov University Information System RUSSIA: RF Social and Budget Statistics Modules with Research-assisting.

Similar presentations


Presentation on theme: "Anna Bogomolova, Tatyana N. Yudina, Oleg Karasev, Ruslan Sennov University Information System RUSSIA: RF Social and Budget Statistics Modules with Research-assisting."— Presentation transcript:

1 Anna Bogomolova, Tatyana N. Yudina, Oleg Karasev, Ruslan Sennov University Information System RUSSIA: RF Social and Budget Statistics Modules with Research-assisting Services. System of Subject Headings to Cross-Search Data and Documents on Public Finances Research Computing Center of Moscow State University NCO Center for Information Research

2 SourceRetrospectiveDocuments Government documents/ State agencies Publication /Kodeks Law Firm 1990-…55,000 +700,000 State Duma daily records State Duma1994-…150,000 State StatisticsState Statistics Agency; RF Ministries, CIS Interstate Statistics Committee 1998-…55,000 Mass mediaExpert weekly; Nezavis. gazeta; Izvestia; … 199(7)-…400,000 State agencies analytical reports, Think tanks reports Central Bank of RF; Russian-European Center for Economic Policy; Fiscal Policy Center 1996-…20,000 Academic publications MSU Bulletins, Economic Forecasting, Sociology Research, Law… 1999-…3,000 +230,000ref +40,000 in English University Information System RUSSIA Collections 1 5 University Information System RUSSIA Collections 1 500,000/ 17.5Gb (www.cir.ru)

3

4 Sociopolitical Thesaurus 70,000 concepts, 110,000 conceptual relations  constructed specially as a tool for automatic text processing;  contains terms from economic, financial, political, military, social, legislative and cultural domains;  a set of relations is adapted to information-retrieval applications;  regularly tested during automatic text processing

5 THESAURUS for Information Retrieval in Sociopolitical Domain  Thesaurus provides for query refinement - reformulation/expansion  Terminology of Thesaurus covers 95-98% of words and terms of Russian government publications, academic papers and mass media texts from 1991  Thesaurus is a main element of ALTP/automatic linguistic text processing technology.

6 Query Refinement

7 Thematic modules University Information System RUSSIA includes:  Module of Socioeconomic State Statistics of Russia  Budget Statistics Module  Module of documents of the European Court of Human Rights

8

9 System of Subject Headings for Budget Data 87 hierarchic categories First level categories are:  Macroeconomic Indicators  Budget Revenues and Expenditures  Tax Concessions  Budget Deficit/Surplus  State and Municipal Debt  Budget Process  Budget Federalism  Extra-Budgetary Funds  State Authorities  Fiscal Misconduct

10 Category Description “Tariffs of Natural Monopolies”  Tariffs & natural monopoly  Tariffs & (gas or electricity or housing and public utilities or railway service)  Tariffs & (Unified Energy System of Russia or Gasprom)

11 Further developments  Including microdata  Developing and testing of budget thesaurus  Developing databases of socioeconomic and budgetary statistics


Download ppt "Anna Bogomolova, Tatyana N. Yudina, Oleg Karasev, Ruslan Sennov University Information System RUSSIA: RF Social and Budget Statistics Modules with Research-assisting."

Similar presentations


Ads by Google