Настоящий номер журнала «Электронные библиотеки» является второй частью тематического выпуска и включает статьи, подготовленные их авторами на основе материалов, представленных ими в 2020 году на XXII Всероссийской научной конференции «Научный сервис в сети Интернет».

 

Эта конференция была проведена с 21 по 25 сентября 2020 года и традиционно была посвящена направлениям и тенденциям использования интернет-технологий в современных научных исследованиях. Основная цель конференции — предоставить возможность для обсуждения, апробации и обмена мнениями о наиболее значимых результатах, полученных ведущими российскими учеными за последнее время в данной области деятельности. Организатором конференции был Институт прикладной математики им. М.В. Келдыша Российской академии наук. В связи со сложившейся эпидемической обстановкой конференция была проведена в режиме онлайн.

 

Первая часть тематического выпуска размещена в №1 журнала «Электронные библиотеки» за 2021 год, вторая часть – в настоящем номере.

 

М. М. Горбунов-Посадов, А. М. Елизаров

Published: 28.04.2021

Authors Identification within the Subject Area in the Semantic Library

Olga Muratovna Ataeva, Vladimir Alekseevich Serebriakov, Natalia Pavlovna Tuchkova
198-217
Abstract:

The peculiarities of the task of authors identifying and determining author's contribution to publications in digital bibliographic codes are considered. The features of the problem of insufficient identification are manifested in the repetition of information, doubling, the presence of authors with completely coincidental names, self-quotation, autoplagiate and plagiarism itself. It is proposed to use publication information that has already been accumulated in the digital library in the form of related object area data and a variety of target thesaurus data, as the author and user of the library. This information contains links whereby keyword contexts, multiple co-authors, and term associations in dictionaries and thesauruses can be used to identify authorship. It is important that an array of scientific publications is considered, since they have an established traditional structure, which allows comparing fixed text elements (annotations, keywords, classifier codes, etc.). Thus, even if the names in the publications are fully matched, the question of authorship can be raised if the publications in the digital library correspond to different subject areas. Resolution of such contradictions is accomplished by evaluating a plurality of links of all elements of secondary publication information. The result of the comparison could be the addition of the author to a specific area, i.e. the extension of the addressee's thesaurus and the author's personal thesaurus, or the appearance of full namesakes in the library, but from different areas of knowledge. It has been shown that modern data analysis tools allow you to evaluate the author's contribution to publication, despite the fact that of course, only the scientific community can evaluate the real contribution to scientific research.

Information System for Registering the Result of Scientific Institution Employees’ Intellectual Activity

Svetlana Aleksandrovna Vlasova, Nikolay Evgenevich Kalenov
218-237
Abstract:

The article describes a typical object-oriented WEB-system designed for storing and providing various reference and statistical data on the scientific works of employees of an institution (group of institutions), developed by specialists of the JSCC RAS. The system contains information about publications of employees and reports made by them at scientific conferences, symposiums, and seminars. The system is focused on working with objects belonged to classes connected between each other, such as "author", "organization", "publication", "report", "event". The metadata profile of objects of each class includes attributes that are necessary to get detailed information about both an individual object of this class and a group of objects associated with the specified attribute values of objects of other classes. For example, you have to get a list of articles by employees of a given organization published articles in a given journal for a given period of time. A distinctive feature of the system is the introduced concept of "equivalent" objects. Such objects are "persons" corresponding to the same author with different spellings of the last name in the bibliographic descriptions of publications; organizations with different versions of names; articles which are published without changes in different languages. This article describes in detail the features of the system, its user interface, and provides examples of performing specific queries.

Algorithms for Formation of Metadata Mathematical Retro Collections Based on Analysis of Structural Features of Documents

Polina Olegovna Gafurova, Alexander Michailovich Elizarov, Evgeny Konstantinovich Lipachev
238-271
Abstract:

The solutions of the main problems associated with the formation of digital mathematical collections from documents published in the pre-digital period are presented – such collections are designated in the work as retro collections. Algorithms for creating a meta description of retro collections based on the analysis of the structure of mathematical documents and the use of software tools for extracting metadata are given. The description of retro-collections formed using the developed algorithms and included in the metadata factory of the digital mathematical library Lobachevskii-DML is given. The schemes for the formation of metadata and methods for normalizing the extracted metadata in accordance with the schemes and requirements of the integrating mathematical libraries are indicated.

Applying Machine Learning to the Task of Generating Search Queries

Alexander Michailovich Gusenkov, Alina Rafisovna Sittikova
272-293
Abstract:

In this paper we research two modifications of recurrent neural networks – Long Short-Term Memory networks and networks with Gated Recurrent Unit with the addition of an attention mechanism to both networks, as well as the Transformer model in the task of generating queries to search engines. GPT-2 by OpenAI was used as the Transformer, which was trained on user queries. Latent-semantic analysis was carried out to identify semantic similarities between the corpus of user queries and queries generated by neural networks. The corpus was convert-ed into a bag of words format, the TFIDF model was applied to it, and a singular value decomposition was performed. Semantic similarity was calculated based on the cosine measure. Also, for a more complete evaluation of the applicability of the models to the task, an expert analysis was carried out to assess the coherence of words in artificially created queries.

Some Aspects of the Formation and Representation Prnciple of Interdisciplinary Collection in the Digital Space of Scientific Knowledge

Sergey Aleksandrovich Kirillov, Irina Nikolaevna Sobolevskaya, Aleksandr Nikolaevich Sotnikov
294-314
Abstract:

Interdisciplinary thematic projects implemented by means of the electronic library "Scientific heritage of Russia" allow integrating objects of various nature (printed publications, archival documents, multimedia objects) into a single thematic resource and making it accessible to users. The approaches to the formation of interdisciplinary thematic collections in the digital space of scientific knowledge are investigated. Algorithms for the formation and presentation of a digital interdisciplinary collection are presented. The method of creation and presentation of virtual collections in the information environment of the electronic library "Scientific heritage of Russia". The main types of sections present in most projects are indicated. The main stages of the formation of an interdisciplinary collection in the digital space of knowledge have been formed and described, including the composition of the collection sections, sources for presenting collection materials, dispatching work with sources, the formation of metadata, the main types of sections, etc. An example of the application of the content formation methodology for creating an interdisciplinary collection is given.

The Use of Thematic Analysis Methods in Scientometric Systems

Alexander Sergeevich Kozitsyn, Sergey Alexandrovich Afonin, Dmitry Alekseevich Shachnev
315-338
Abstract:

Modern scientometric systems and citation systems use various mechanisms of thematic search and thematic filtering of information. In most cases, a full-text approach is used for thematic analysis of articles and journals, which has a number of limitations. The use of algorithms based on graph analysis, both independently and in conjunction with full-text algorithms, eliminates these limitations and improves the completeness and accuracy of subject search. The algorithm developed by the authors and presented in this work uses the co-authorship graph to analyze the thematic proximity of journals. The algorithm is insensitive to the language of the journal and selects similar journals in different languages, which is difficult to implement for algorithms based on the analysis of full-text information. The algorithm was tested in the scientometric system IAS ISTINA. In the interface developed for these purposes, the user can select one journal that is close to him on the subject, and the system will automatically generate a selection of journals that may be of interest to the user both in terms of studying the materials available in them and in terms of publishing his own articles. In the future, the developed algorithm can be adapted to search for similar conferences, collections of publications and scientific projects. The presence of such a tool will increase the publication activity of young employees, increase the citation rate of articles and the citation rate between journals. The results of the algorithm for determining thematic proximity between journals, collections, conferences and scientific projects can also be used to build rules in models of differentiating access to data based on domain ontologies.

Research of the Contexts of the Ecosystem of "Digital Tourism"

Olga Vital'evna Kononova , Dmitry Evgenievich Prokudin, Elena Nikolaevna Tupikina
339-370
Abstract:

Modern information and communication technologies, elements of digitalization are constantly and rapidly developing, which, in turn, has a direct impact on all spheres of human activity. In the light of recent events related to the collapse of the tourism business due to COVID-19, there is a great scientific interest in the service sector, namely in the field of "digital tourism". Digital tourism relies on the widespread adoption of new technologies such as social media and mobile technologies, smart devices and sensors to collect and use massive amounts of data to create new value propositions. In this regard, the authors set a goal – to present a review of the literature on "digital tourism" from the standpoint of scientific and media discourse. The authors present a comprehensive scientific approach, including the sequential implementation of all stages of the review, from the definition of the terminological core of the interdisciplinary direction, the formation of search queries, cascade search, selection and content analysis of materials to the identification and explication of contexts. The sources of information for preparing the review were publications from academic databases: Web of Science, Science-Direct, Scopus, GoogleScholar, eLibrary, Cyberleninka, as well as materials and publications in Russian-language media – Integrum.


The results obtained by the authors will be useful for scientists in identifying promising areas of research in the field of "digital tourism", as well as deepen their knowledge of mechanisms for searching, collecting and analyzing data and integrated and analytical environments.

Refutation of a Rumor by the Mass Media: Mathematical Model and Numerical Experiments

Alexander Petrovich Mikhailov, Alexander Petrov
371-386
Abstract:

The process is considered, in which an unreliable rumor spreads in society, which is opposed by the broadcasting of the mass media. In this case, the unreliability of hearing is understood so that the information of the media contains a refutation and thereby inoculates individuals, that is, makes them immune to hearing. At the same time, individuals who have managed to accept the rumor cease to trust the media and thereby become unavailable for persuasion. For this process, a mathematical model is proposed in two versions. The variant with continuous time reveals some of the mathematical properties of the model. The discrete time option is more convenient for analyzing real processes since it allows one to estimate the parameters of the model. To assess these parameters, data on the ratings of the main socio-political programs of Russian TV channels were used. Several scenario calculations of the model with these parameters are presented. The main conclusion is that if the information disseminated by the media is not viral, that is, it is not retold by viewers to their neighbors in society, then the media are unable to resist rumors.

Preprint as the Material for an Overlay Journal

Tatyana Alekseevna Polilova
387-407
Abstract:

The Open access movement has a long history. In 2002 the Budapest Open access initiative was first announced. However, the problem of Open access has not yet been fully and definitively resolved. In 2018 The European Union has adopted Plan S, which calls for making Open access a reality by 2020. Plan S emphasizes the importance of self-archiving of articles and the role of Preprint’s archives (servers) for scientific results placement. It is noted that Preprint archives have a great potential for editorial and publishing innovations. Scientific journals with limited reader access that operate on a commercial basis do not give up their positions. But even here we see some progress. Journals have become less rigid in their policy towards preprints and post-prints.


More and more foreign scientists are becoming adherents of the "Fair open access" movement, which offers a new organizational solution. The journal must have a scientific organization or non-profit Foundation as a founder, that hires a group of executors to provide editorial and publishing services. Editors and publishers should not have their own commercial interests. The scientific journal should be funded from the general contribution of organizations.


The article considers a modern type of online scientific journal — the overlay journal. The cost of an issue of the overlay journal is so low that the journal can easily implements the "free for the author, free for the reader" scheme. The overlay journal is based on the public servers of preprints. The online overlay journal reviews the article received from the archive. If the article is accepted for publication, the article metadata is published on the journal website, and the full text of corrected article is re-archived. This way of working does not overload the archive functionality, but it allows to reduce the financial burden on the overlay journal.

Storyboard as One of the Representations of the Scenario Prototype of Computer Games

Vlada Vladimirovna Kugurakova, Gulnara Faritovna Sahibgareeva , Oleg Aleksandrovich Bedrin
408-444
Abstract:

The work is devoted to the study and improvement of the design, development, and testing of video game storytelling. The existing practices of writing and keeping up-to-date scripts for interactive works have been studied. The definition of a scenario prototype and requirements for its form are formulated. An idea was put forward about the efficiency of automating the creation of a scenario prototype in the form of a generator tool. A vision of such a tool has been drawn up. The impact of such a tool on development order is presented. Implemented a tool component and conducted an experiment that proves its effectiveness with an example such as generating storyboards from the text. Plans for future development have been formulated.