Title: Anna Bogomolova, Tatyana N' Yudina, Oleg Karasev, Ruslan Sennov
1Research Computing Center of Moscow State
University NCO Center for Information Research
Anna Bogomolova, Tatyana N. Yudina, Oleg
Karasev, Ruslan Sennov University Information
System RUSSIA RF Social and Budget Statistics
Modules with Research-assisting Services. System
of Subject Headings to Cross-Search Data and
Documents on Public Finances
2University Information System RUSSIA Collections
1 500,000/ 17.5Gb (www.cir.ru)
3(No Transcript)
4Sociopolitical Thesaurus
70,000 concepts, 110,000 conceptual
relations
- constructed specially as a tool for
automatic text processing - contains terms from economic, financial,
political, military, social, legislative
and cultural domains - a set of relations is adapted to
information-retrieval applications - regularly tested during automatic text
processing
5THESAURUS for Information Retrievalin
Sociopolitical Domain
- Thesaurus provides for query refinement -
reformulation/expansion - Terminology of Thesaurus covers 95-98 of
words and terms of Russian government
publications, academic papers and mass media
texts from 1991 - Thesaurus is a main element of ALTP/automatic
linguistic text processing technology.
6Query Refinement
7Thematic modules
- University Information System RUSSIA includes
- Module of Socioeconomic State Statistics of
Russia - Budget Statistics Module
- Module of documents of the European Court of
Human Rights
8(No Transcript)
9System of Subject Headings for Budget Data
- 87 hierarchic categories
- First level categories are
- Macroeconomic Indicators
- Budget Revenues and Expenditures
- Tax Concessions
- Budget Deficit/Surplus
- State and Municipal Debt
- Budget Process
- Budget Federalism
- Extra-Budgetary Funds
- State Authorities
- Fiscal Misconduct
10Category DescriptionTariffs of Natural
Monopolies
- Tariffs natural monopoly
- Tariffs (gas or electricity or housing and
public utilities or railway service) - Tariffs (Unified Energy System of Russia or
Gasprom)
11Further developments
- Including microdata
- Developing and testing of budget thesaurus
- Developing databases of socioeconomic and
budgetary statistics -