This HTML5 document contains 34 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

PrefixIRI
dcthttp://purl.org/dc/terms/
dbohttp://dbpedia.org/ontology/
foafhttp://xmlns.com/foaf/0.1/
dbthttp://dbpedia.org/resource/Template:
rdfshttp://www.w3.org/2000/01/rdf-schema#
freebasehttp://rdf.freebase.com/ns/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
owlhttp://www.w3.org/2002/07/owl#
n12http://en.wikipedia.org/wiki/
dbchttp://dbpedia.org/resource/Category:
dbphttp://dbpedia.org/property/
provhttp://www.w3.org/ns/prov#
xsdhhttp://www.w3.org/2001/XMLSchema#
dbrhttp://dbpedia.org/resource/

Statements

Subject Item
dbr:Normalized_Google_distance
rdfs:label
Normalized Google distance
rdfs:comment
The Normalized Google Distance (NGD) is a semantic similarity measure derived from the number of hits returned by the Google search engine for a given set of keywords. Keywords with the same or similar meanings in a natural language sense tend to be "close" in units of Normalized Google Distance, while words with dissimilar meanings tend to be farther apart. Specifically, the Normalized Google Distance (NGD) between two search terms x and y is . "Shakespeare" and "Macbeth" arevery much alike according to the relative semantics supplied by Google.
owl:sameAs
freebase:m.0fph22r
dbp:wikiPageUsesTemplate
dbt:Doi dbt:Reflist
dct:subject
dbc:Statistical_distance dbc:Computational_linguistics
prov:wasDerivedFrom
n12:Normalized_Google_distance?oldid=1021228071&ns=0
dbo:wikiPageID
29662478
dbo:wikiPageLength
7431
dbo:wikiPageRevisionID
1021228071
dbo:wikiPageWikiLink
dbr:Oxford_English_Dictionary dbc:Computational_linguistics dbr:Semantic_similarity dbr:World_Wide_Web dbr:Symmetry dbr:Google_Search dbr:Set_(abstract_data_type) dbr:Wikipedia dbr:Normalized_compression_distance dbr:Metric_(mathematics) dbr:Triangle dbr:Google dbr:William_Shakespeare dbr:Bible dbr:Support-vector_machine dbr:Macbeth dbr:Prime_number dbr:Similarity_measure dbc:Statistical_distance dbr:WordNet dbr:Index_term
dbo:abstract
The Normalized Google Distance (NGD) is a semantic similarity measure derived from the number of hits returned by the Google search engine for a given set of keywords. Keywords with the same or similar meanings in a natural language sense tend to be "close" in units of Normalized Google Distance, while words with dissimilar meanings tend to be farther apart. Specifically, the Normalized Google Distance (NGD) between two search terms x and y is where N is the total number of web pages searched by Google multiplied by the average number of singleton search terms occurring on pages; f(x) and f(y) are the number of hits for search terms x and y, respectively; and f(x, y) is the number of web pages on which both x and y occur. If the then x and y are viewed as alike as possible, but if then x and y are very different.If the two search terms x and y never occur together on the same web page, but do occur separately, the NGD between them is infinite. If both terms always occur together, their NGD is zero. Example: On 9 April 2013, googling for "Shakespeare" gave 130,000,000 hits;googling for "Macbeth" gave 26,000,000 hits; and googlingfor "Shakespeare Macbeth" gave 20,800,000 hits.The number of pages indexed by Google was estimated by the numberof hits of the search term "the" which was 25,270,000,000 hits. Assumingthere are about 1,000 search terms on the average page this gives .Hence . "Shakespeare" and "Macbeth" arevery much alike according to the relative semantics supplied by Google.
foaf:isPrimaryTopicOf
n12:Normalized_Google_distance