• Main Navigation
  • Main Content
  • Sidebar

Russian Digital Libraries Journal

  • Home
  • About
    • About the Journal
    • Aims and Scopes
    • Themes
    • Editor-in-Chief
    • Editorial Team
    • Submissions
    • Open Access Statement
    • Privacy Statement
    • Contact
  • Current
  • Archives
  • Register
  • Login
  • Search
Published since 1998
ISSN 1562-5419
16+
Language
  • Русский
  • English

Search

Advanced filters

Search Results

Science Data Infrastructure for Access to Earth Observation Satellite Data

Е.Б. Кудашев
Abstract: Virtual research centre of digital preservation in Europe provides a natural basis for long-term consolidation of digital preservation research and expertise. Spatial Data Infrastructure will cover technical methods for preservation, access and most importantly re-use of data holdings over the whole lifecycle; legal and economic issues including costs and governance issues as well as digital rights; and outreach within and outside the consortium to help to create a discipline of data curators with appropriate qualifications. Main tasks of Spatial Data Infrastructure SDI development are building global infrastructure for IT and geodata; satellite information harmonization; usage of agreed upon set of standards; clear documentation describing the parts of the system; interoperability between independently created applications and databases; common standards within their interfaces, protocols and data formats; and finally support of a general data policy for data creation, access, and support of satellite information. Fundamental principle of Russian segment of SDI is providing interoperability – the ability of interaction for heterogeneous services and data catalogues within the bounds of a unified informational system. The Russian segment of distributed informational system has been built on the basis of EOLI-XML and SSE technologies.
Keywords: Science Data Infrastructure, e-Science, Earth Observation data, Scientific e-Infrastructure, Open Data Infrastructure, Data management.

Подсистема проведения конференций и ее метаданные

А.Н. Алексеев, А.В. Созыкин, Г.Ф. Масич, А.Н. Бездушный
Abstract: Работа посвящена рассмотрению подсистемы поддержки проведения конференций системы "Научный институт РАН", требований к ней, возможностей использования технологий системы ИСИР РАН и предоставляемую ею функциональность (интеграция, поиск, репликация). Описана объектная схема данных подсистемы - ее метаданные.

Procedure for Comparing Text Recognition Software Solutions For Scientific Publications by the Quality of Metadata Extraction

Ilia Igorevich Kuznetsov , Oleg Panteleevich Novikov, Dmitry Yurievich ILIN
654-680
Abstract:

Metadata of scientific publications are used to build catalogs, determine the citation of publications, and perform other tasks. Automation of metadata extraction from PDF files provides means to speed up the execution of the designated tasks, while the possibility of further use of the obtained data depends on the quality of extraction. Existing software solutions were analyzed, after which three of them were selected: GROBID, CERMINE, ScientificPdfParser. A procedure for comparing software solutions for recognizing texts of scientific publications by the quality of metadata extraction is proposed. Based on the procedure, an experiment was conducted to extract 4 types of metadata (title, abstract, publication date, author names). To compare software solutions, a dataset of 112,457 publications divided into 23 subject areas formed on the basis of Semantic Scholar data was used. An example of choosing an effective software solution for metadata extraction under the conditions of specified priorities for subject areas and types of metadata using a weighted sum is given. It was determined that for the given example CERMINE shows efficiency 10.5% higher than GROBID and 9.6% higher than ScientificPdfParser.

Keywords: text recognition, scientific publications, metadata, data extraction quality, procedure.

Methodology and technology for creating of the multi-purposed information environment T-System based on the digital library with flexible full-text search

С.Х. Ляпин, А.В. Куковякин
Abstract: We describe hereby the methodology and technology for creating of the multi-purposed information environment ‘T System’ based on the extension of the digital library T-Libra. The environment is destined for the integration of resources and services, which a typical for digital library with flexible full-text search, virtual museum, digital archive, research laboratory and educational server. The methodological basis this sort of extension is a hybrid two-level ontology based on interaction of functional systems (top level), concepts library and thesauri library (lower level). The technological basis of extension is an administrative division, which has tools for flexible set-up of T-System’s, as well as an unified search system includes the mechanisms of nonlinear cascade inquiries, which of that generate a relevant functional systems and combine the results of full-text search, related thesauri and concepts, text metadata and non-text objects of different modalities (graphics, sound, video etc.). The above-mentioned environment is designed in three-tier architecture (Web-browser / Web-server + Application server / DB server) with using of special indexing system for increase of search effectiveness, as well as of external logic which is built-up in Application Server.

Analysis of the Distribution of Key Terms in Scientific Articles

Svetlana Aleksandrovna Vlasova, Nikolay Evgenievich Kalenov, Irina Nikolaevna Sobolevskaya
35-51
Abstract:

One of the Common Digital Space of Scientific Knowledge (CDSSK) main components are the subject ontologies of individual thematic subspaces, which include the basic concepts related to this scientific area. The constructing subject ontologies task at the initial phase requires the array of key terms formation in a given scientific are with the subsequent establishment of links between them. A similar task is in the encyclopedias formation in terms of the articles (slots) list generating that determines their content. One of the sources for the formation of the key terms array can be the metadata of articles published in the leading scientific journals. Namely, the author's key terms ("keywords" in the terminology of the journals editors) quoted by the article. To make a conclusion about the possibility of using this approach to the subject ontologies formation, it is necessary to conduct the author's key terms array preanalysis, both in terms of real correspondence to the main areas of research in this science branch and in terms of the distribution of the certain terms occurrence frequency. This article presents the results of the occurrence frequency analysis of the author's key terms in Russian and English, carried out on the software processing basis of several thousand articles from leading Russian journals in mathematics, computer science and physics, reflected in the MathNet database. An assessment was made of the distribution of key terms correspondence (as phrases) and individual words to the Bradford's law, and the key terms cores within the thematic direction were identified.

Keywords: digital space of scientific knowledge, subject ontologies, encyclopedia articles, key terms, article metadata, frequency analysis.

Metadata for describing collection of periodicals

А.Г. Абросимов
Abstract: Статья посвящена проблеме формирования метаданных коллекции периодической печати 19 – начала 20 веков, создаваемой в Научной библиотеке Казанского государственного университета (НБ КГУ) при поддержке Российского гуманитарного научного фонда (проект № 04-01-12032в).
Собрание местной периодической печати НБ КГУ является одним из самых полных, в нее входят практически все газеты, издававшиеся в Казани в 19 веке, цензорские экземпляры газет, в которых сохранились первые редактуры произведений А.М. Горького, В.Г. Короленко, Н.Г. Гарина-Михайловского и других. Собрание периодической печати активно используются при научных изысканиях. При таком интенсивном использовании часть коллекции пришла в негодность и не выдается читателям. Кроме того, сказывается естественное старение и разрушение бумаги. Таким образом, существует реальная угроза потери части коллекции, которая, как исторический источник, имеет не только национальное, но и международное значение.
В связи с вышеизложенным, приоритетным направлением в создании электронной библиотеки КГУ (ЭБ КГУ) является создание коллекции электронных документов на основе собрания местной периодической печати конца XIX – начала XX вв.

Bibliographic Database Ratings and White Lists

Tatyana Alekseevna Polilova
640-670
Abstract:

Currently, Russian institutions are almost completely disconnected from Western information resources and services related to the publication of scientific journals. In such conditions, the task of replacing the departed services, reorientation to domestic scientific journals, Russian online library resources has become particularly actual. In the largest bibliographic database the eLibrary.ru, focused on Russian-language scientific publications, collected information about almost 15 thousand Russian-language journals. In the eLibrary.ru there is an analytical system "Russian Science Citation Index" that processes metadata of articles from more than 5 thousand Russian scientific journals. Is the eLibrary.ru ready to serve as a national bibliographic database? For what reason "white lists" of journals appear in Russian organizations?


The main problem of the RSCI is the quality of the constructed ratings of scientific journals. The methods of calculating ratings over the past years have caused certain criticisms. The paper provides an example of a rating of journals from the section "Mathematics" built in the RSCI. Journals that are little known among professional mathematicians were in the first positions. Serious deformations in the ratings of the eLibrary.ru undermine the confidence of scientists in the assessments of the credibility of Russian journals proposed by the eLibrary.ru. The reaction of some universities and scientific organizations is quite expected: organizations are beginning to introduce their own criteria for the success of the publication activities of employees associated with the publication of articles in journals from the so-called "white lists". The white list of journals is compiled, as a rule, by the expert councils of the organization specifically for each discipline and scientific direction. Scientometric indicators may be taken into account when compiling white lists, but they are not the primary criterion for the selection of journals. White lists can now become a reasonable addition to the ratings of bibliographic databases.

Keywords: scientific publication, rating of journals, thematic classification, impact factor, multidisciplinary, bibliographic reference, white list of scientific journals.

Semantic library as a tool of defining a scientific subject area

Olga Muratovna Ataeva, Vladimir Alekseevich Serebriakov
988-1005
Abstract:

The paper considers an information system designed to represent a subject area related to science and its features. Highlighted general concepts for formal descriptions of such a subject area in the knowledge base of the semantic library. The peculiarity of these areas is that the data structure is subject to frequent changes. Therefore, the means of organizing knowledge, which is a semantic library, should be sufficiently universal and not require deep technical knowledge. The paper describes the functionality of the system and its use. For each area, the set of resources can differ both in format and in the set of the resources themselves. The set of concepts that form the description of the library's content should be so universal that it can be adapted to the needs of a particular area. Three levels of metadata are used to represent the data.

Keywords: semantic library, ontology, knowledge representation.

Метаданные ИСИР: определение и использование

А.Н. Бездушный, А.М. Меденников, А.М. Серебряков, А.А. Филиппова, А.С. Лопатенко

Preprint as the Material for an Overlay Journal

Tatyana Alekseevna Polilova
387-407
Abstract:

The Open access movement has a long history. In 2002 the Budapest Open access initiative was first announced. However, the problem of Open access has not yet been fully and definitively resolved. In 2018 The European Union has adopted Plan S, which calls for making Open access a reality by 2020. Plan S emphasizes the importance of self-archiving of articles and the role of Preprint’s archives (servers) for scientific results placement. It is noted that Preprint archives have a great potential for editorial and publishing innovations. Scientific journals with limited reader access that operate on a commercial basis do not give up their positions. But even here we see some progress. Journals have become less rigid in their policy towards preprints and post-prints.


More and more foreign scientists are becoming adherents of the "Fair open access" movement, which offers a new organizational solution. The journal must have a scientific organization or non-profit Foundation as a founder, that hires a group of executors to provide editorial and publishing services. Editors and publishers should not have their own commercial interests. The scientific journal should be funded from the general contribution of organizations.


The article considers a modern type of online scientific journal — the overlay journal. The cost of an issue of the overlay journal is so low that the journal can easily implements the "free for the author, free for the reader" scheme. The overlay journal is based on the public servers of preprints. The online overlay journal reviews the article received from the archive. If the article is accepted for publication, the article metadata is published on the journal website, and the full text of corrected article is re-archived. This way of working does not overload the archive functionality, but it allows to reduce the financial burden on the overlay journal.

Keywords: scientific journal, Fair open access, Open archive, server of preprints, overlay journal.

Метаданные и первые результаты каталогизации Интернет

М.Е. Шварцман

Some Aspects of the Formation and Representation Prnciple of Interdisciplinary Collection in the Digital Space of Scientific Knowledge

Sergey Aleksandrovich Kirillov, Irina Nikolaevna Sobolevskaya, Aleksandr Nikolaevich Sotnikov
294-314
Abstract:

Interdisciplinary thematic projects implemented by means of the electronic library "Scientific heritage of Russia" allow integrating objects of various nature (printed publications, archival documents, multimedia objects) into a single thematic resource and making it accessible to users. The approaches to the formation of interdisciplinary thematic collections in the digital space of scientific knowledge are investigated. Algorithms for the formation and presentation of a digital interdisciplinary collection are presented. The method of creation and presentation of virtual collections in the information environment of the electronic library "Scientific heritage of Russia". The main types of sections present in most projects are indicated. The main stages of the formation of an interdisciplinary collection in the digital space of knowledge have been formed and described, including the composition of the collection sections, sources for presenting collection materials, dispatching work with sources, the formation of metadata, the main types of sections, etc. An example of the application of the content formation methodology for creating an interdisciplinary collection is given.

Keywords: virtual exhibition , e-library, scientific heritage, databases, electronic records, digital copies.

Вопросы интеграции управления идентификацией пользователей сетевых, вычислительных и информационных сервисов

А.В. Созыкин, Г.Ф. Масич, А.Г. Масич, А.Н. Бездушный
Abstract: В статье рассматривается подход к управлению идентификацией пользователей корпоративной сети учреждений РАН. Управление идентификацией представляет собой процесс, охватывающий весь жизненный цикл учетных записей пользователей и контроль над правами доступа в распределенных средах. Рассматриваемый подох позволяет интегрировать управление идентификацией пользователей сетевых, вычислительных и информационных сервисов. Архитектура предлагаемого решения основывается на открытых стандартах, использовании многоуровневой компонентной архитектуры LDAP-систем (OpenLDAP, iPlanet Directory) и доступа к ним как через локальные, так и через глобальные сети. Эти работы поддержаны грантами РФФИ 03-07-90140в, 04-07-96003, 02-07-90305ск.
В качестве основного вида хранилищ конфигурационной информации сетевых сервисов сейчас обычно рассматриваются LDAP каталоги. Во множестве конфигурационной информации сетевых сервисов выделяются данные, представляющие интерес для информационно-справочных сервисов – так называемые «метаданные сетевых сервисов».
Поскольку LDAP каталоги часто используются в качестве корпоративных справочников общего назначения, чтобы осуществить принципы «единой точки доступа», «согласованной модификации данных» как информационной, так и сетевой и вычислительной инфраструктур научного учреждения, проводится сравнение схем метаданных информационно-справочных системы ИСИР и стандартных схем LDAP каталогов.
Кроме того, уделяется внимание и такому варианту использования LDAP каталогов как репозитория хранимых объектов информационно-справочных сервисов. Предлагается способы отображения RDFS схемы данных в LDAP схемы.
1 - 13 of 13 items
Information
  • For Readers
  • For Authors
  • For Librarians
Make a Submission
Current Issue
  • Atom logo
  • RSS2 logo
  • RSS1 logo

Russian Digital Libraries Journal

ISSN 1562-5419

Information

  • About the Journal
  • Aims and Scopes
  • Themes
  • Author Guidelines
  • Submissions
  • Privacy Statement
  • Contact
  • eLIBRARY.RU
  • dblp computer science bibliography

Send a manuscript

Authors need to register with the journal prior to submitting or, if already registered, can simply log in and begin the five-step process.

Make a Submission
About this Publishing System

© 2015-2025 Kazan Federal University; Institute of the Information Society