• Main Navigation
  • Main Content
  • Sidebar

Russian Digital Libraries Journal

  • Home
  • About
    • About the Journal
    • Aims and Scopes
    • Themes
    • Editor-in-Chief
    • Editorial Team
    • Submissions
    • Open Access Statement
    • Privacy Statement
    • Contact
  • Current
  • Archives
  • Register
  • Login
  • Search
Published since 1998
ISSN 1562-5419
16+
Language
  • Русский
  • English

Search

Advanced filters

Search Results

Automatic Replenishment of Metadata of Digital Publications using Semantic Services of the Internet

Polina Olegovna Gafurova
164-186
Abstract:

The article describes approaches to replenishing metadata of documents in electronic collections of a digital mathematical library. An open resource of the semantic network is used as a replenishment. For this purpose, software tools have been developed to search for the necessary data and include it in a metadata set. A separate block of metadata in a scientific article is formed from the affiliation of the authors presented in the document. Typically, the ownership that occurs in a document does not contain sufficient data to generate a set of metadata. A method has been developed for providing author affiliation metadata, providing an open register of scientific organization identifiers (ROR), as well as means for making connections between ROR and other semantic chains. This method was applied to the collections of articles of the journal “Digital Libraries” for 2021–2022.


The article describes a method for connecting the Lobachevsky digital mathematical library-DML to new electronic collections, and describes a method for transforming metadata into a digital format available for downloading.

Keywords: ROR, Wikidata, digital libraries, affiliation metadata, Lobachevskii-DML.

Preprint as the Material for an Overlay Journal

Tatyana Alekseevna Polilova
387-407
Abstract:

The Open access movement has a long history. In 2002 the Budapest Open access initiative was first announced. However, the problem of Open access has not yet been fully and definitively resolved. In 2018 The European Union has adopted Plan S, which calls for making Open access a reality by 2020. Plan S emphasizes the importance of self-archiving of articles and the role of Preprint’s archives (servers) for scientific results placement. It is noted that Preprint archives have a great potential for editorial and publishing innovations. Scientific journals with limited reader access that operate on a commercial basis do not give up their positions. But even here we see some progress. Journals have become less rigid in their policy towards preprints and post-prints.


More and more foreign scientists are becoming adherents of the "Fair open access" movement, which offers a new organizational solution. The journal must have a scientific organization or non-profit Foundation as a founder, that hires a group of executors to provide editorial and publishing services. Editors and publishers should not have their own commercial interests. The scientific journal should be funded from the general contribution of organizations.


The article considers a modern type of online scientific journal — the overlay journal. The cost of an issue of the overlay journal is so low that the journal can easily implements the "free for the author, free for the reader" scheme. The overlay journal is based on the public servers of preprints. The online overlay journal reviews the article received from the archive. If the article is accepted for publication, the article metadata is published on the journal website, and the full text of corrected article is re-archived. This way of working does not overload the archive functionality, but it allows to reduce the financial burden on the overlay journal.

Keywords: scientific journal, Fair open access, Open archive, server of preprints, overlay journal.

Algorithm for linking translated articles using authorship statistics

Александр Сергеевич Козицын, Сергей Александрович Афонин, Андрей Александрович Зензинов
494-505
Abstract: During the last decades scientometric techniques have been used for research activity stimulation. Number of published articles and number of their citation counts are among the most important scientometric parameters. In an automated environment, when the publications metadata is gathered from various sources, correct linking of original papers with their translations into different languages is extremely important. In the paper we show that the known text similarity measures are inefficient in the context of article linkage problem. We propose a method for semi-automatic article linkage using statistical data on authors publication activities only. This approach may be used for linking articles without training for the language of translation. The method was evaluated on real-world collection of publications metadata of ISTINA information system.
Keywords: bibliographic data, graph analysis, translation, article, statistics, scientometrics, citation, automated systems.

Digital infrastructure of electronic scientific journal: automation of editorial and publishing process and system of services

Миляуша Салахутдиновна Галявиева, Александр Михайлович Елизаров, Евгений Константинович Липачёв
408-465
Abstract:

We investigated the current models of the publication and dissemination of scientific knowledge. We describe the modern information management system of scientific publications and services that determine their functionality.

We discuss the concept of the digital infrastructure of the electronic scientific journal. Under this infrastructure, we understand the complex that combines management software platform of electronic journal and a number of specialized information systems. The software platform realizes the basic operating log management processes. Information systems provide the operation of additional services, taking into account the specifics of the journal subject area.

We present an approach to the organization of the digital infrastructure of the scientific journal based on an open platform Open Journal Systems (OJS). We provide software services that extend the functionality of this platform and considering specificity of the subject area of scientific journals. We have created software modules for automating of electronic scientific journal workflow. These modules are an extension of OJS.

We present a system of services for the automated processing of collections of scientific documents. These services provide verification of document compliance to the accepted rules of formation of collections and their conversion to the established formats; structural analysis of documents and extraction of metadata, as well as their integration into the scientific information space. The system allows to automatically performing a set of operations that cannot be realized for acceptable time with the traditional manual processing of electronic content. It is designed for the large collections of scientific documents.

Algorithms style validation of texts at the article registration stage in the information system of electronic scientific journal, the selection of reviewers, alert and control the timing of reviewing were automated. Information gathering algorithm with dedicated news lines of scientific journals, further analysis and distribution of news by categories and degrees of importance were developed. The algorithm automatically extract bibliographic data from a homogeneous array of publications (in particular, the issues of the scientific journal) and the formation of metadata blocks for export to international information and analytical system were created. Methods integration of OJS platform and international databases of science citation were developed.

We present methods of processing documents containing mathematical formulas: in the collections of documents that contain mathematical formulas, algorithm for the search formulas is developed; basic ideas, approaches and results already obtained by the mathematical knowledge management based on ontology are presented; a method of constructing recommender systems based on mathematical knowledge ontologies described. The method of primary processing automated of scientific article using TеX-notation developed.

The new direction of researches of scientific communications in the environment of Web 2.0 – altmetrics – is considered. We have analyzed the content of the notion «altmetrics», we conducted a comparison of traditional (bibliometric and scientometric) and alternative indicators. We describe the use of world experience informetric services on scientific journals sites. We discussed options for implementing these approaches to create an electronic scientific journal management platform. 

Keywords: publishing systems, advanced models of publication and dissemination of scientific knowledge, the information society, electronic scientific journal, modern information management system of scientific publication, integration of electronic resources.

Development of a System for Searching and Indexing the Content of Audio Recordings

Roman Aleckseevich Klimov, Azat Shavkatovich Yakupov
483-497
Abstract:

The article is devoted to the development of a search and indexing system for audio files using Automatic Speech Recognition (ASR) and Elasticsearch. Current Russian-language audio file transcription systems have been analyzed, and Whisper has been chosen as the best one. An algorithm for optimizing transcription speed using parallelization of file processing processes has been developed, and its effectiveness has been demonstrated. A microservice architecture-based system has been built, capable of indexing audio file content and their metadata for search purposes. The research results show that the proposed approach can be applied to create efficient and flexible systems for searching and analyzing audio information.

Keywords: transcription, indexing, parallelization, microservices, scalability.

Development of the Information System for Registering the Result of Scientific Institution’ Employees Intellectual Activity

Svetlana Aleksandrovna Vlasova, Nikolay Evgenevich Kalenov
770-793
Abstract:

The article describes a Web-system developed by the authors that implements services related to the formation and provision of multifaceted information about the results of scientific activities (publications, copyright certificates and reports at scientific events) of employees of an organization or a group of organizations. The system is focused both on the end user interested in obtaining specific data, and on the administrative staff, who generates reporting materials for the parent organization. The information base of the system contains metadata on the following classes of objects: persons (authors), organizations and their subdivisions; publications at analytical, monographic and summary levels; copyright certificates; scientific events (conferences, symposia, seminars); reports. The system includes two modules – an administrative one (intended for entering and editing data) and a user one, which is a special search engine that searches for information, visualizes it, provides navigation among related resources and exports data. A distinctive feature of the system is the introduced concept of “equivalent” objects. Objects are considered equivalent if they are represented in the system by different metadata, but referring to the same physical entity. Such objects are “persons” corresponding to one author with different spellings of the surname in the bibliographic descriptions of publications; organizations with different variants of names; articles published unchanged in various languages. In accordance with modern requirements for reporting on publications, the system reflects the sources of research funding, as well as the affiliations indicated in the articles for each author.

Keywords: scientific works, scientific activity, automated system, database, management reports, network technologies.

Analysis of the Distribution of Key Terms in Scientific Articles

Svetlana Aleksandrovna Vlasova, Nikolay Evgenievich Kalenov, Irina Nikolaevna Sobolevskaya
35-51
Abstract:

One of the Common Digital Space of Scientific Knowledge (CDSSK) main components are the subject ontologies of individual thematic subspaces, which include the basic concepts related to this scientific area. The constructing subject ontologies task at the initial phase requires the array of key terms formation in a given scientific are with the subsequent establishment of links between them. A similar task is in the encyclopedias formation in terms of the articles (slots) list generating that determines their content. One of the sources for the formation of the key terms array can be the metadata of articles published in the leading scientific journals. Namely, the author's key terms ("keywords" in the terminology of the journals editors) quoted by the article. To make a conclusion about the possibility of using this approach to the subject ontologies formation, it is necessary to conduct the author's key terms array preanalysis, both in terms of real correspondence to the main areas of research in this science branch and in terms of the distribution of the certain terms occurrence frequency. This article presents the results of the occurrence frequency analysis of the author's key terms in Russian and English, carried out on the software processing basis of several thousand articles from leading Russian journals in mathematics, computer science and physics, reflected in the MathNet database. An assessment was made of the distribution of key terms correspondence (as phrases) and individual words to the Bradford's law, and the key terms cores within the thematic direction were identified.

Keywords: digital space of scientific knowledge, subject ontologies, encyclopedia articles, key terms, article metadata, frequency analysis.

Information System for Registering the Result of Scientific Institution Employees’ Intellectual Activity

Svetlana Aleksandrovna Vlasova, Nikolay Evgenevich Kalenov
218-237
Abstract:

The article describes a typical object-oriented WEB-system designed for storing and providing various reference and statistical data on the scientific works of employees of an institution (group of institutions), developed by specialists of the JSCC RAS. The system contains information about publications of employees and reports made by them at scientific conferences, symposiums, and seminars. The system is focused on working with objects belonged to classes connected between each other, such as "author", "organization", "publication", "report", "event". The metadata profile of objects of each class includes attributes that are necessary to get detailed information about both an individual object of this class and a group of objects associated with the specified attribute values of objects of other classes. For example, you have to get a list of articles by employees of a given organization published articles in a given journal for a given period of time. A distinctive feature of the system is the introduced concept of "equivalent" objects. Such objects are "persons" corresponding to the same author with different spellings of the last name in the bibliographic descriptions of publications; organizations with different versions of names; articles which are published without changes in different languages. This article describes in detail the features of the system, its user interface, and provides examples of performing specific queries.

Keywords: databases, research results accounting, WEB-based system, network technologies, publication activity analysis, software.

Scientific Publications and the Embedding Space of Knowledge

Andreas Khachaturovich Marinosyan, Sergey Georgievich Grigoriev
565-594
Abstract:

The article examines current challenges in scientometrics arising from the surge in publication activity and the widespread adoption of generative artificial intelligence. The existing scientometric toolkit for analyzing research activity is reviewed, categorized into quantitative metrics and science mapping methods (citation network analysis, academic genealogy, semantic analysis, etc.). An attempt is made to overcome the limitations of traditional citation analysis, such as “semantic blindness” and vulnerability to manipulation. As a potential solution, a conceptual model is proposed where the unit of analysis shifts from the publication as a whole to an individual “key statement”. This approach involves recording not only the statement’s content but also its type, area of relevance, and its logical relationship with other claims (confirmation, refutation, clarification, generalization, etc.). Within this framework, principles for calculating modified scientometric metrics are introduced.


The proposed model was tested on a corpus of 728 articles from the Russian  journal Informatics and Education (2016–2025). An analysis conducted using large language models revealed that retrospective extraction of statements faces significant hurdles due to established cultures of scientific communication. Consequently, the study highlights the advantages of having authors formulate key statements themselves as a distinct type of metadata. In conclusion, the paper outlines development paths for the concept of an “embedding space of knowledge,” which could eventually complement existing approaches to analyzing the evolution of scientific ideas and theories.

Keywords: scientometrics, academic genealogy, citation analysis, semantic analysis, large language models, science map, h-index, nanopublications.

Digital Repository "Geologyscience.Ru": Open Access To Scientific Publications On Russian Geology

Michail Ivanovich Patuk, Vera Viktorovna Naumova, Vitaliy Sergeevich Eremenko
1324-1338
Abstract:

The article describes new approaches related to the collection of data from heterogeneous information systems of access to scientific publications using open international standards and protocols for the formation of systems of open access to scientific geological publications. Based on developed and adapted approaches and technological solutions, a set of programs of information and analytical system of access to scientific publications has been implemented, implementing functions of collection, search, cataloguing, filtering and management of scientific publications and their metadata.

Keywords: information technology, Earth sciences, repository, scientific publications.

Web application development based on technologies, resources and services of the Geoportal of the Institute of Computational Modelling SB RAS

О.Э. Якубайлик, А.А. Кадочников, А.В. Токарев
Abstract: The geoportal is a mapping web site; it can be described as specialized software and technologies for spatial data processing. Geoportal's main task is to provide the user with the tools and services of storing and cataloguing, publications and download the spatial (geographic) data, search and filter by metadata, interactive web visualization, direct access to geodata based web mapping services. Geoportal developed in ICM SB RAS with appropriate set of its components and services, has become a GIS platform for creating a number of applied GIS web applications. The article deals with the experience of design and development of these systems.
Keywords: spatial data processing, geodata, web mapping services, geoportal, GIS web applications.
1 - 11 of 11 items
Information
  • For Readers
  • For Authors
  • For Librarians
Make a Submission
Current Issue
  • Atom logo
  • RSS2 logo
  • RSS1 logo

Russian Digital Libraries Journal

ISSN 1562-5419

Information

  • About the Journal
  • Aims and Scopes
  • Themes
  • Author Guidelines
  • Submissions
  • Privacy Statement
  • Contact
  • eLIBRARY.RU
  • dblp computer science bibliography

Send a manuscript

Authors need to register with the journal prior to submitting or, if already registered, can simply log in and begin the five-step process.

Make a Submission
About this Publishing System

© 2015-2026 Kazan Federal University; Institute of the Information Society