Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites

Одним з найважливіших напрямків підвищення ефективності пошуку інформації в Інтернеті є активне застосування концепції семантичного веб. В статті проведено аналіз перспектив використання концепції семантичного веб для аналізу текстових даних в мережі Інтернет. Розглянуто проблеми її застосування до...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Datum:2014
1. Verfasser: Zosimov, V.V.
Format: Artikel
Sprache:English
Veröffentlicht: Міжнародний науково-навчальний центр інформаційних технологій і систем НАН та МОН України 2014
Schriftenreihe:Індуктивне моделювання складних систем
Online Zugang:http://dspace.nbuv.gov.ua/handle/123456789/83992
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Назва журналу:Digital Library of Periodicals of National Academy of Sciences of Ukraine
Zitieren:Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites / V.V. Zosimov // Індуктивне моделювання складних систем: Зб. наук. пр. — К.: МННЦ ІТС НАН та МОН України, 2014. — Вип. 6. — С. 41-46. — Бібліогр.: 8 назв. — англ.

Institution

Digital Library of Periodicals of National Academy of Sciences of Ukraine
id irk-123456789-83992
record_format dspace
spelling irk-123456789-839922015-07-03T03:01:51Z Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites Zosimov, V.V. Одним з найважливіших напрямків підвищення ефективності пошуку інформації в Інтернеті є активне застосування концепції семантичного веб. В статті проведено аналіз перспектив використання концепції семантичного веб для аналізу текстових даних в мережі Інтернет. Розглянуто проблеми її застосування до вже створених сайтів та шляхи їх вирішення. One of the most important ways of increasing information retrieval efficiency on the Internet is the active application of semantic web concept. The article analyzes the prospects of usage the semantic web concept for text data analysis on the Internet. The problems of its application to the already established sites and ways to solve them. Одним из важнейших направлений повышения эффективности поиска информации в Интернете является активное применение концепции семантического веб. В статье проведен анализ перспектив использования концепции семантического веба для анализа текстовых данных в сети Интернет. Рассмотрены проблемы его использования к уже созданным сайтам и пути их решения. 2014 Article Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites / V.V. Zosimov // Індуктивне моделювання складних систем: Зб. наук. пр. — К.: МННЦ ІТС НАН та МОН України, 2014. — Вип. 6. — С. 41-46. — Бібліогр.: 8 назв. — англ. XXXX-0044 http://dspace.nbuv.gov.ua/handle/123456789/83992 004.9 en Індуктивне моделювання складних систем Міжнародний науково-навчальний центр інформаційних технологій і систем НАН та МОН України
institution Digital Library of Periodicals of National Academy of Sciences of Ukraine
collection DSpace DC
language English
description Одним з найважливіших напрямків підвищення ефективності пошуку інформації в Інтернеті є активне застосування концепції семантичного веб. В статті проведено аналіз перспектив використання концепції семантичного веб для аналізу текстових даних в мережі Інтернет. Розглянуто проблеми її застосування до вже створених сайтів та шляхи їх вирішення.
format Article
author Zosimov, V.V.
spellingShingle Zosimov, V.V.
Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites
Індуктивне моделювання складних систем
author_facet Zosimov, V.V.
author_sort Zosimov, V.V.
title Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites
title_short Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites
title_full Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites
title_fullStr Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites
title_full_unstemmed Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites
title_sort prospects for applying the concept of the semantic web analysis for existing sites
publisher Міжнародний науково-навчальний центр інформаційних технологій і систем НАН та МОН України
publishDate 2014
url http://dspace.nbuv.gov.ua/handle/123456789/83992
citation_txt Prospects for Applying the Concept of the Semantic Web Analysis for Existing sites / V.V. Zosimov // Індуктивне моделювання складних систем: Зб. наук. пр. — К.: МННЦ ІТС НАН та МОН України, 2014. — Вип. 6. — С. 41-46. — Бібліогр.: 8 назв. — англ.
series Індуктивне моделювання складних систем
work_keys_str_mv AT zosimovvv prospectsforapplyingtheconceptofthesemanticwebanalysisforexistingsites
first_indexed 2025-07-06T10:53:09Z
last_indexed 2025-07-06T10:53:09Z
_version_ 1836894587532083200
fulltext V. V. Zosimov Індуктивне моделювання складних систем, випуск 6, 2014 41 УДК 004.9 PROSPECTS FOR APPLYING THE CONCEPT OF THE SEMANTIC WEB ANALYSIS FOR EXISTING SITES V.V. Zosimov Mykolaiv V.O. Suhomlynsky National University zosimovvv@bk.ru Одним з найважливіших напрямків підвищення ефективності пошуку інформації в Інтернеті є активне застосування концепції семантичного веб. В статті проведено аналіз перспектив використання концепції семантичного веб для аналізу текстових даних в мережі Інтернет. Розглянуто проблеми її застосування до вже створених сайтів та шляхи їх вирішення. Ключові слова: семантика, Інтернет, пошук інформації, аналіз даних, сайт, пошуковий агент, онтологія, обробка та зберігання інформації, структури даних.. One of the most important ways of increasing information retrieval efficiency on the Internet is the active application of semantic web concept. The article analyzes the prospects of usage the semantic web concept for text data analysis on the Internet. The problems of its application to the already established sites and ways to solve them. Keywords: semantics, Internet, information retrieval, data analysis, site search agent, ontology, information processing and storage, data structures. Одним из важнейших направлений повышения эффективности поиска информации в Интернете является активное применение концепции семантического веб. В статье проведен анализ перспектив использования концепции семантического веб для анализа текстовых данных в сети Интернет. Рассмотрены проблемы ее применения к уже созданным сайтам и пути их решения. Ключевые слова: семантика, Интернет, поиск информации, анализ данных, сайт, поисковый агент, онтология, обработка и хранение информации, структуры данных. Introduction. Modern Internet is dynamically developing. In early 2012, there were 330 million sites. By the end of the year its number has increased more than 2 times and reached 743 million. Number of working sites in January 2014 amounted to 861.4 million. Total number of sites currently stands at more than a billion. [1] (Fig 1,2) Sites number rapid growth has led to the fact that the developers of search engines are faced with several problems. "Dimension" problem one of the principals. Standard information search and processing methods that used by modern search engines are losing their effectiveness with such a large amount of the required documents. User receives millions of web pages as a response for search request. Search results priority order is determined by search engines ranking algorithms which do not consider any user's preferences, or context in which the search query is consume. According to Internet live statistics [2] 31.9% of users are browsing only first results page, and then follow to one of the sites. 23%, are browsing two pages before make a choice. 16.1% browse only first three links, and do not look at the other results. Prospects for Applying the Concept of the Semantic Web Analysis for Existing Sites Індуктивне моделювання складних систем, випуск 6, 2014 42 These data show that almost half (48%) of Internet users are following link from the first page and do not browse even the second. Respectively, they browse maximum ten results from hundreds thousands or even millions search results found by entered keywords. If user does not find information on the first page, he changes the search query. And there is no guarantee that the first page contains information which is more complete and meaning appropriate to the user's needs. About half (45.9%) of the respondents admitted that almost immediately found right products and services. Thirds of respondents (33.3%) can not find required data in three cases out of four. 13.3% of users receive necessary information only in half of the cases. Finally, 5.1% are right only a quarter of cases and 2.3% do not understand how is it possible to find something using search engines. In consequence of this there is an urgent need to develop new information retrieval models, which can choose the most appropriate documents within the meaning of user's needs among found by keywords. Fig 1. Live Internet statistic [3] Tim Berners Lee proposed concept of the Semantic Web as a new stage in the Internet development, which would allow the machines easier understand and process contained in the website information due to the of semantic site markup. [2] According to the developers, this approach will help to eliminate the main drawback of modern research - the "dimension" problem, as well as to make information retrieval on the Internet more comfortable for the user. A number of search engines based on search agents were developed under this concept. These systems give the user results relevant not only to their keywords, but taking into account pages semantic content and additional search parameters input by the user. V. V. Zosimov Індуктивне моделювання складних систем, випуск 6, 2014 43 Fig 2. Web sites number growing schedule [3] 1. Semantic web concept Semantic Web - is a public global semantic network that is formed on the World Wide Web basis through the information standardization in the machine processing suitable form.[4] In a typical World Wide Web based on HTML-pages, the information lies in the text pages and is intended for human reading and understanding. The Semantic Web consists of machine-readable elements - semantic network nodes, buttress on ontology. Due to this, programs-clients can directly get statements "object - type relationship - the other object" and determine logical conclusions for them. Semantic Web works in parallel with the conventional World Wide Web and on its basis, using the HTTP protocol and resource identifiers URI. [5] Semantic Web is a symbiosis of the two directions, the first of which covers data representation languages. The main languages are Extensible Markup Language XML [6] and Resource Description Framework RDF [7]. There are also a number of other formats, but XML and RDF offer more opportunities, so they are recommended by W3C The second, conceptual direction carries a theoretical understanding of the domain model. Such models of domains in terms of the Semantic Web are called ontologies. February 10, 2004, the W3C adopted and published a web ontology language OWL [8] specification. Prospects for Applying the Concept of the Semantic Web Analysis for Existing Sites Індуктивне моделювання складних систем, випуск 6, 2014 44 Fig 3. Semantic Web concepts Stack The Semantic Web project involves the creation of system with "artificial intelligence" elements, which would allow special applications to look for high quality relevant information, as well as share information to each other over the Internet. At the same time the ontology language OWL is a decisive component of intellectualization, the basis for the semantic networks construction. It should be noted that the semantic networks theory has been focused on the artificial intelligence problems, such as machine translation systems. Knowledge in the semantic networks theory were represented as nodes connected by arcs, each of which specifies the relationship type. The Semantic Web is essentially a realization of the artificial intelligence idea, but the term is not very popular because of the large number of failed projects in this area, so the "semantic web" concept is today alertness. However, web ontology essentially a real knowledge base, one of the artificial intelligence conceptual foundations. In the future, the correct website execution will allow interacting services with based on the semantic relationships analysis between concepts and objects available in the network, provide the user information that will satisfy his needs 2. Applying the Semantic Web concept to existing sites Despite the active development of the Semantic Web concept, there are number of problems that do not allow it to be used to implement a wide range of users: 1. The Semantic Web concept assumes that the data analysis and the most optimal results selection will make machine automatically. Search agents show the V. V. Zosimov Індуктивне моделювання складних систем, випуск 6, 2014 45 person a few most relevant results on the basis of semantic analysis. And if the search agents algorithms are not determined for users preferences, then he can not find appropriate results and other results, among which there is one that would satisfy users preferences will be filtered by machine. Thus, rather than to facilitate search task for the user application of this concept severely restricts his choice. This brings us back to the common search engines problem. Search agents usually have the opportunity to extended search settings, but not all users are willing to spend time learning new interfaces. Therefore it is necessary to develop the most simple and user-friendly interfaces of user interaction with the search agent. Otherwise, a long and complex setting search parameters may scare the user. 2. Semantic Web concept imposes special requirements on sites development standards. But most developers do not follow all the sites requirements for web pages creation. This happens for several reasons: 1) Lack of time to develop the project, the time limits set by the customer do not allow make high-quality semantic markup. 2) For today a very small percentage of Internet users are searching information using search agents. The bulk of the users prefer simple and familiar search tools. So sites customers do not see the point in spending time and money on semantic markup. 3) The company, which provides services do not make site semantic markup due to unprofitability or developers unwillingness to learn new technology. 3. The most significant drawback is the absence of effective methods and technologies for automatic semantic annotation of existing web pages. Considering the Internet growth rate, the number of active sites up to date is more than one billion. Bring all these web resources to the semantic web standards by hand without the use of automatic machine methods is an impossible task. User deliberately narrows the search to about 5% of all contained on the web pages using semantic search. If websites without semantic markup contains user necessary information he does not get access to it using a semantic search. It follows that the Semantic Web concept can not properly function without the development automatic semantic annotation methods for existing web pages. Development of such methods will allow the concept of the semantic web to cover a large part of information resources located on the Internet, and continue to develop into a complete, user-friendly and widely used tool of web data mining and information retrieval. Conclusions. Semantic web concept became an opportunity for the Internet to avoid the crisis of large dimension. But despite all the advantages of the concept it has is one major drawback. A huge number of existing sites created without semantic markup can not fully participate in the intellectual retrieval. Huge number of newly created sites are also done without usage of new technologies. Therefore, semantic search will only work with the newly created site made using all semantic web rules. This suggests need to search for new tools that enable the automatic semantic markup of existing sites and to simplify the creation of semantic markup for new sites. Prospects for Applying the Concept of the Semantic Web Analysis for Existing Sites Індуктивне моделювання складних систем, випуск 6, 2014 46 Література 1. http://www.netcraft.com/ - report for the number of working sites on the Internet. 2. http://iprospect.com - statistics service for the viewed by user search results pages 3. http://internetlivestats.com/ - real time statistics service for the Internet services status. 4. http://wikipedia.org/ - the semantic web definition, architecture, implementation problems 5. http://www.w3.org/ - description of the semantic web concept, syntax, data definition language. 6. www.w3.org/TR/NOTE-xml-ql/ - query Language XML (Extensible Markup Language) 7. http://www.w3.org/TR/rdf-tuturial - Manual for the RDF (Resource Description Framework). 8. http://www.w3.org/TR/owl-ref/ - Manual for the OWL (web ontology language).