●One of the difficulties in building an SQL-like query lange for the Web is the absence of a database_(71)_for this huge, heterogeneous repository of information. However, if we are interested in HTML documents only, we can construct a virtual schema fro
<尚大教育,教育至上,人才为大:sdedu.cc>
●One of the difficulties in building an SQL-like query lange for the Web is the absence of a database_(71)_for this huge, heterogeneous repository of information. However, if we are interested in HTML documents only, we can construct a virtual schema from the implicit structure of these files. Thus, at the highest level of _(72)_,every such document is identified by its Uniform Resource Locator(URL), has a_(73)_ and atext. Als, Wed servers provide som additional information sucn as the type, length, and the last modification date of a document. So, for data mining purposes, we can consider the ste of all HTML documents as arelation:
Documen (url, title, text, type, length, modify )
Where all the _(74)_ are character strings. In this framework, anindividual document is identified with a_(75)_ in this relation. Of course, if some optional information is missing from the HTML document, the associate fields will de left blank, but this is not uncommon in any database.
(71)A.schema B.platform C.module D.relation
(72)A.protocol B.control C.abstracdon D.presentation
(73)A.table B.title C.dhver D.event
(74)A.type B.1inks C.charactehsdcs D.atthbutes
(75)A.relation B.field C.schpt D.tuple
<尚大教育,教育至上,人才为大:sdedu.cc>