文章的正确链接:
Maltseva N.K., Popova V.O., Syrov A.A. —
// 软件系统和计算方法.
– 2022. – № 1.
– 和。 1 - 8.
DOI: 10.7256/2454-0714.2022.1.19323 URL: https://cn.nbpublish.com/library_read_article.php?id=19323
文章的正确链接:
Kiryanov D.A. —
// 软件系统和计算方法.
– 2022. – № 1.
– 和。 9 - 31.
DOI: 10.7256/2454-0714.2022.1.37341 URL: https://cn.nbpublish.com/library_read_article.php?id=37341
注释,注释:
The subject of this research is the key methods for creating the architecture of information aggregators, methods for increasing scalability and effectiveness of such systems, methods for reducing the delay between the publication of new content by the source and emergence of its copy in the information aggregator. In this research, the content aggregator implies the distributed high-load information system that automatically collects information from various sources, process and displays it on a special website or mobile application. Particular attention is given to the basic principles of content aggregation: key stages of aggregation and criteria for data sampling, automation of aggregation processes, content copy strategies, and content aggregation approaches. The author's contribution consists in providing detailed description of web crawling and fuzzy duplicate detection systems. The main research result lies in the development of high-level architecture of the content aggregation system. Recommendations are given on the selection of the architecture of styles and special software regime that allows creating the systems for managing distributed databases and message brokers. The presented architecture aims to provide high availability, scalability for high query volumes, and big data performance. To increase the performance of the proposed system, various caching methods, load balancers, and message queues should be actively used. For storage of the content aggregation system, replication and partitioning must be used to improve availability, latency, and scalability. In terms of architectural styles, microservice architecture, event-driven architecture, and service-based architecture are the most preferred architectural approaches for such system.
关键词:
Summarization, Data sampling criteria, Content aggregation stages, Fuzzy duplicates detection, Web crawling, Distributed system architecture, Content aggregation, High load system, Poisson process, Microservices architecture
文章的正确链接:
Pekunov V.V. —
// 软件系统和计算方法.
– 2022. – № 1.
– 和。 32 - 41.
DOI: 10.7256/2454-0714.2022.1.37240 URL: https://cn.nbpublish.com/library_read_article.php?id=37240
文章的正确链接:
Borevich E.V., Yanchus V.E. —
// 软件系统和计算方法.
– 2022. – № 1.
– 和。 42 - 54.
DOI: 10.7256/2454-0714.2022.1.37730 URL: https://cn.nbpublish.com/library_read_article.php?id=37730
文章的正确链接:
Soldatov A.S., Maslov S.V., Kukushkin Y.A. —
// 软件系统和计算方法.
– 2022. – № 1.
– 和。 55 - 70.
DOI: 10.7256/2454-0714.2022.1.24631 URL: https://cn.nbpublish.com/library_read_article.php?id=24631
文章的正确链接:
Zorchenkov A.M. —
// 软件系统和计算方法.
– 2022. – № 1.
– 和。 71 - 87.
DOI: 10.7256/2454-0714.2022.1.34337 URL: https://cn.nbpublish.com/library_read_article.php?id=34337