Knowledge graphs: Difference between revisions

    From Consumerium development wiki R&D Wiki
    (+ '''SPARQL endpoint''' at databus.dbpedia.org)
     
    (6 intermediate revisions by the same user not shown)
    Line 2: Line 2:


    See also: [[mw:Manual:Managing data in MediaWiki]] at mediawiki.org
    See also: [[mw:Manual:Managing data in MediaWiki]] at mediawiki.org
    = DBpedia =
    [[File:DBpediaLogo.svg|thumb|right|260px|The [https://wiki.dbpedia.org/ DBpedia] logo]]
    '''[[w:DBpedia|DBpedia]]''' [https://wiki.dbpedia.org/ (.org)] is a community effort to enable the web moving ''"Towards a Public Data Infrastructure for a Large, Multilingual, Semantic Knowledge Graph"''.
    Today the DBpedia '''[[w:data set|data set]]s''' contain a wealth of information structured into '''[[w:Ontology (information science)|ontologies]]'''. This [[w:structured data|structured data]] can be queried with [[SPARQL]] query language at their [http://dbpedia.org/sparql public DBpedia SPARQL endpoint].
    There are many methods how the DBpedia ontology and datasets could be used in the Consumerium implementation stage wiki.
    == Ontology classes useful for implementing Consumerium ==
    '''[http://mappings.dbpedia.org/server/ontology/classes/ All DBpedia ontology classes]'''
    * [http://mappings.dbpedia.org/server/ontology/classes/Company Company]
    * [http://mappings.dbpedia.org/server/ontology/classes/Non-ProfitOrganisation Non-ProfitOrganisation]
    * [http://mappings.dbpedia.org/server/ontology/classes/Food Food]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Beverage Beverage]
    * [http://mappings.dbpedia.org/server/ontology/classes/MeanOfTransportation MeanOfTransportation]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Aircraft Aircraft]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Automobile Automobile]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Locomotive Locomotive]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Ship Ship]
    * [http://mappings.dbpedia.org/server/ontology/classes/Place Place]
    ** [http://mappings.dbpedia.org/server/ontology/classes/ArchitecturalStructure ArchitecturalStructure]
    *** [http://mappings.dbpedia.org/server/ontology/classes/Building Building]
    **** [http://mappings.dbpedia.org/server/ontology/classes/Hotel Hotel]
    **** [http://mappings.dbpedia.org/server/ontology/classes/Restaurant Restaurant]
    **** [http://mappings.dbpedia.org/server/ontology/classes/ShoppingMall ShoppingMall]
    *** [http://mappings.dbpedia.org/server/ontology/classes/Infrastructure Infrastructure]
    **** [http://mappings.dbpedia.org/server/ontology/classes/Airport Airport]
    **** [http://mappings.dbpedia.org/server/ontology/classes/RouteOfTransportation RouteOfTransportation]
    ***** [http://mappings.dbpedia.org/server/ontology/classes/RailwayLine RailwayLine]
    ***** [http://mappings.dbpedia.org/server/ontology/classes/Road Road]
    **** [http://mappings.dbpedia.org/server/ontology/classes/Station Station]
    ***** [http://mappings.dbpedia.org/server/ontology/classes/RailwayStation RailwayStation]
    * [http://mappings.dbpedia.org/server/ontology/classes/PopulatedPlace PopulatedPlace]
    ** [http://mappings.dbpedia.org/server/ontology/classes/AdministrativeRegion AdministrativeRegion]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Continent Continent]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Country Country]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Island Island]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Settlement Settlement]
    *** [http://mappings.dbpedia.org/server/ontology/classes/City City]
    *** [http://mappings.dbpedia.org/server/ontology/classes/Town Town]
    *** [http://mappings.dbpedia.org/server/ontology/classes/Village Village]
    == DBpedia datasets ==
    * '''[http://wiki.dbpedia.org/Datasets DBpedia datasets]''' have been released annually, sometimes with improvements more frequently. The latest (as of July 2018) [https://wiki.dbpedia.org/develop/datasets/dbpedia-version-2016-10 DBpedia dataset version is 2016-10] which was published in 2017.
    == DBpedia Databus ==
    At DBpedia there is ongoing work on what is called '''[https://wiki.dbpedia.org/blog/dbpedia-databus-%E2%80%93-transforming-linked-data-networked-data-economy DBpedia Databus]''' to take their game to the next level. Databus Alpha was published in May 2018.
    == History of DBpedia ==
    DBpedia began as an effort to extract structured information from [[Wikipedia]] [[templates|infobox templates]] and [[categories]] and to make this information available on the Web with the initial release on January 10th 2007.
    == More info on DBpedia  ==
    * [https://wiki.dbpedia.org/ DBpedia wiki]
    * [https://blog.dbpedia.org/ DBpedia blog]
    * [https://github.com/dbpedia/ DBpedia code at GitHub]
    ----


    = Wikidata =
    = Wikidata =
    Line 115: Line 53:
    * [[w:Wikipedia:Wikidata|Wikipedia advice on Wikidata issues]]
    * [[w:Wikipedia:Wikidata|Wikipedia advice on Wikidata issues]]


    = Wikibase =
    == Wikibase ==
    [[File:Wikibase_logo.png|thumb|right|250px|The [http://wikiba.se/ Wikibase] logo]]
    [[File:Wikibase_logo.png|thumb|right|250px|The [http://wikiba.se/ Wikibase] logo]]
    '''[[mw:Wikibase|Wikibase]]''' [http://wikiba.se/ (wikiba.se)] is a system for storing and querying structured data that powers [[Wikidata]] and other wikis.
    '''[[mw:Wikibase|Wikibase]]''' [http://wikiba.se/ (wikiba.se)] is a system for storing and querying structured data that powers [[Wikidata]] and other wikis.
    Line 124: Line 62:
    :# [[mw:Extension:Wikibase Client|Wikibase Client]] that allows a wiki to access structured data from a repository. The client can work only with repository databases it can access so they must be on the same machine or the same load balancer.
    :# [[mw:Extension:Wikibase Client|Wikibase Client]] that allows a wiki to access structured data from a repository. The client can work only with repository databases it can access so they must be on the same machine or the same load balancer.


    == Installation of Wikibase ==
    === Installation of Wikibase ===
    [[mw:Wikibase/Installation|Wikibase installation instructions at Mediawiki.org]] and [[mw:Wikibase/Installation/Advanced_configuration|advanced configuration of Wikibase]].  
    [[mw:Wikibase/Installation|Wikibase installation instructions at Mediawiki.org]] and [[mw:Wikibase/Installation/Advanced_configuration|advanced configuration of Wikibase]].  


    The installation instructions assume you are installing the dependencies with [[mw:Composer|Composer]], a PHP package manager that makes the installation of dependencies easy.
    The installation instructions assume you are installing the dependencies with [[mw:Composer|Composer]], a PHP package manager that makes the installation of dependencies easy.


    == Useful extensions in conjunction with Wikibase ==
    === Useful extensions in conjunction with Wikibase ===
    * [[mw:Extension:ArticlePlaceholder]] makes article placeholders from repository data and invites users to create the article.
    * [[mw:Extension:ArticlePlaceholder]] makes article placeholders from repository data and invites users to create the article.
    * [[mw:Extension:UniversalLanguageSelector]] is recommended to work in conjunction with Wikibase for user comfort.
    * [[mw:Extension:UniversalLanguageSelector]] is recommended to work in conjunction with Wikibase for user comfort.
    Line 136: Line 74:
    * [[#LinkedWiki extension]] can be configured to access multiple SPARQL endpoints.
    * [[#LinkedWiki extension]] can be configured to access multiple SPARQL endpoints.


    == Useful information ==
    === Useful information ===
    * [[mw:Wikibase/DataModel|The data model used in Wikibase]] (thorough) and [[mw:Wikibase/DataModel/Primer|the primer on the data model]] (quick access)
    * [[mw:Wikibase/DataModel|The data model used in Wikibase]] (thorough) and [[mw:Wikibase/DataModel/Primer|the primer on the data model]] (quick access)
    * [https://github.com/wikimedia/mediawiki-extensions-Wikibase/blob/master/docs/federation.wiki Information on the federation of Wikibase at Github]
    * [https://github.com/wikimedia/mediawiki-extensions-Wikibase/blob/master/docs/federation.wiki Information on the federation of Wikibase at Github]
    * [https://phabricator.wikimedia.org/T159240 Phabricator Task to "Document how to set up federated Wikibase instances"]
    * [https://phabricator.wikimedia.org/T159240 Phabricator Task to "Document how to set up federated Wikibase instances"]
    * [https://phabricator.wikimedia.org/T196997 Phabricator Task to "Wikidata/Wikibase federation and distribution"] and the [https://docs.google.com/document/d/1YYIuQzcWz2cH9zTUbbfiBrTecVxMWAu3_9-U4DcKzEU/edit#heading=h.z49pje934vur Google Docs document charting out the task]
    * [https://phabricator.wikimedia.org/T196997 Phabricator Task to "Wikidata/Wikibase federation and distribution"] and the [https://docs.google.com/document/d/1YYIuQzcWz2cH9zTUbbfiBrTecVxMWAu3_9-U4DcKzEU/edit#heading=h.z49pje934vur Google Docs document charting out the task]
    ----
    ----


    Line 159: Line 96:


    ----
    ----
    = Open Food Facts =
    {{:Open Food Facts}}
    ----
    = DBpedia =
    [[File:DBpediaLogo.svg|thumb|right|260px|The [https://www.dbpedia.org/ DBpedia] logo]]
    '''[[w:DBpedia|DBpedia]]''' [https://www.dbpedia.org/ (.org)] is a community effort to enable the web moving ''"Towards a Public Data Infrastructure for a Large, Multilingual, Semantic Knowledge Graph"''.
    Today the DBpedia '''[[w:data set|data set]]s''' contain a wealth of information structured into '''[[w:Ontology (information science)|ontologies]]'''. This [[w:structured data|structured data]] can be queried with [[SPARQL]] query language at their [http://dbpedia.org/sparql public DBpedia SPARQL endpoint].
    DBpedia uses [[w:Virtuoso Universal Server]] to store and query the data. [https://www.w3.org/wiki/VirtuosoUniversalServer VirtuosoUniversalServer at w3.org]
    There are many methods how the DBpedia ontology and datasets could be used in the Consumerium implementation stage wiki.
    == Ontology classes useful for implementing Consumerium ==
    '''[https://dief.tools.dbpedia.org/server/ontology/classes/ All DBpedia ontology classes]'''
    * [http://mappings.dbpedia.org/server/ontology/classes/Company Company]
    * [http://mappings.dbpedia.org/server/ontology/classes/Non-ProfitOrganisation Non-ProfitOrganisation]
    * [http://mappings.dbpedia.org/server/ontology/classes/Food Food]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Beverage Beverage]
    * [http://mappings.dbpedia.org/server/ontology/classes/MeanOfTransportation MeanOfTransportation]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Aircraft Aircraft]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Automobile Automobile]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Locomotive Locomotive]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Ship Ship]


    * [http://mappings.dbpedia.org/server/ontology/classes/Place Place]
    ** [http://mappings.dbpedia.org/server/ontology/classes/ArchitecturalStructure ArchitecturalStructure]
    *** [http://mappings.dbpedia.org/server/ontology/classes/Building Building]
    **** [http://mappings.dbpedia.org/server/ontology/classes/Hotel Hotel]
    **** [http://mappings.dbpedia.org/server/ontology/classes/Restaurant Restaurant]
    **** [http://mappings.dbpedia.org/server/ontology/classes/ShoppingMall ShoppingMall]
    *** [http://mappings.dbpedia.org/server/ontology/classes/Infrastructure Infrastructure]
    **** [http://mappings.dbpedia.org/server/ontology/classes/Airport Airport]
    **** [http://mappings.dbpedia.org/server/ontology/classes/RouteOfTransportation RouteOfTransportation]
    ***** [http://mappings.dbpedia.org/server/ontology/classes/RailwayLine RailwayLine]
    ***** [http://mappings.dbpedia.org/server/ontology/classes/Road Road]
    **** [http://mappings.dbpedia.org/server/ontology/classes/Station Station]
    ***** [http://mappings.dbpedia.org/server/ontology/classes/RailwayStation RailwayStation]
    * [http://mappings.dbpedia.org/server/ontology/classes/PopulatedPlace PopulatedPlace]
    ** [http://mappings.dbpedia.org/server/ontology/classes/AdministrativeRegion AdministrativeRegion]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Continent Continent]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Country Country]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Island Island]
    ** [http://mappings.dbpedia.org/server/ontology/classes/Settlement Settlement]
    *** [http://mappings.dbpedia.org/server/ontology/classes/City City]
    *** [http://mappings.dbpedia.org/server/ontology/classes/Town Town]
    *** [http://mappings.dbpedia.org/server/ontology/classes/Village Village]
    == DBpedia datasets ==
    * '''[http://wiki.dbpedia.org/Datasets DBpedia datasets]''' have been released annually, sometimes with improvements more frequently. The latest (as of July 2018) [https://wiki.dbpedia.org/develop/datasets/dbpedia-version-2016-10 DBpedia dataset version is 2016-10] which was published in 2017.
    == DBpedia Databus ==
    The [https://databus.dbpedia.org/ '''DBpedia Databus''' at databus.dbpedia.org] ''is a data cataloging and versioning platform for data developers and consumers.''
    [https://databus.dbpedia.org/sparql '''SPARQL endpoint''' at databus.dbpedia.org]
    DBpedia developed the DBpedia Databus in the late 2010's with a Databus alpha published in May 2018.
    == History of DBpedia ==
    DBpedia began as an effort to extract structured information from [[Wikipedia]] [[templates|infobox templates]] and [[categories]] and to make this information available on the Web with the initial release on January 10th 2007.
    == More info on DBpedia ==
    * [https://wiki.dbpedia.org/ DBpedia wiki]
    * [https://blog.dbpedia.org/ DBpedia blog]
    * [https://github.com/dbpedia/ DBpedia code at GitHub]
    ----
    = LinkedWiki extension =
    = LinkedWiki extension =
    [[File:LogoLinkedWiki.png|thumb|right|260px|Logo of the [[mw:Extension:LinkedWiki|LinkedWiki extension]]]]
    [[File:LogoLinkedWiki.png|thumb|right|260px|Logo of the [[mw:Extension:LinkedWiki|LinkedWiki extension]]]]
    A possible way to tap into various knowledge graphs is the [[mw:Extension:LinkedWiki|LinkedWiki extension]]. LinkedWiki has been developed since 2010 by [[mw:User:Karima Rafes]], a [[#Semantic MediaWiki]] developer and CEO of [http://www.bordercloud.com/ BorderCloud.com]
    A possible way to tap into various knowledge graphs is the [[mw:Extension:LinkedWiki|LinkedWiki extension]]. LinkedWiki has been developed since 2010 by [[mw:User:Karima Rafes]], a [[#Semantic MediaWiki]] developer and CEO of [http://www.bordercloud.com/ BorderCloud.com]


    = See also =
    ''' See also '''
    * [https://linkedwiki.com/ LinkedWiki.com - The best place to discover available Linked Data and to share SPARQL queries]
    * [https://linkedwiki.com/ LinkedWiki.com - The best place to discover available Linked Data and to share SPARQL queries]
    * [[Database]] for a higher level view of what is going on with the databases
    * [[Database]] for a higher level view of what is going on with the databases

    Latest revision as of 18:45, 4 January 2024

    Acquiring access for our consumers to a semantic network of relevant linked open data compiled by other efforts and structured by a number of ontologies is obviously key to Consumerium. Reciprocally we aim to share the information we gather and compile available to other efforts.

    See also: mw:Manual:Managing data in MediaWiki at mediawiki.org

    Wikidata[edit | edit source]

    The Wikidata logo

    Wikidata (.org) is a knowledge base, an effort to store and serve structured data to Wikimedia wikis and to a more limited extent to other parties. Wikidata effort saw the daylight in 2012.

    The underlying software is the Wikibase which consists of 2 Mediawiki extensions, the Wikibase Repository and the Wikibase Client.

    Wikibase allows interwiki links to be managed with Wikidata removing much contributor annoyanges, redundancy and error-proneness.

    Wikidata is obviously a very viable source of reference level data once it is technically possible for non-WMF wikis to access the data items. (See #LinkedWiki extension for a potential workaround for this limitation)

    It can be accessed outside of WMF wikis with with


    Main entry point of any Wikidata item is a JSON dictionary, that has this form:

    {“labels”: by-language dictionary

    “descriptions”: by-language dictionary

    “aliases”: by-language dictionary

    “claims”: list of property and values

    “sitelinks”: by-language dictionary}

    Lexicographical Wikidata[edit | edit source]

    A lexeme is a unit of lexical meaning that underlies a set of words that are related through inflection. It is a basic abstract unit of meaning,[1] a unit of morphological analysis in w:linguistics that roughly corresponds to a set of forms taken by a single root w:word. For example, in English, run, runs, ran and running are forms of the same lexeme, which can be represented as RUN (Wikipedia on 2019-12-29)

    Since 2018, Wikidata has also stored a new type of data: words, phrases and sentences, in many languages, described in many languages. This information is stored in new types of entities, called Lexemes (L), Forms (F) and Senses (S).[2]. This is enabled by the WikibaseLexeme extension.

    Useful information[edit | edit source]

    More info

    Wikibase[edit | edit source]

    The Wikibase logo

    Wikibase (wikiba.se) is a system for storing and querying structured data that powers Wikidata and other wikis.

    Wikibase consists of two extensions:

    1. Wikibase Repository that allows a wiki to work as a repository for structured data.
    2. Wikibase Client that allows a wiki to access structured data from a repository. The client can work only with repository databases it can access so they must be on the same machine or the same load balancer.

    Installation of Wikibase[edit | edit source]

    Wikibase installation instructions at Mediawiki.org and advanced configuration of Wikibase.

    The installation instructions assume you are installing the dependencies with Composer, a PHP package manager that makes the installation of dependencies easy.

    Useful extensions in conjunction with Wikibase[edit | edit source]

    Alternative to using Wikibase Client[edit | edit source]

    Useful information[edit | edit source]


    Semantic MediaWiki[edit | edit source]

    Semantic MediaWiki (.org) (SMW) is a free, open-source extension to MediaWiki that lets you store & query semantic data within the wiki and it seems well suited to Consumerium's information infrastructure needs.

    Spinoff extensions[edit | edit source]

    A variety of open-source MediaWiki extensions exist that use the data structure provided by Semantic MediaWiki.

    Among the most notable are of the Semantic MediaWiki extensions:

    • Semantic Forms - enables user-created forms for adding and editing pages that use semantic data
    • Semantic Result Formats - provides a large number of display formats for semantic data, including charts, graphs, calendars and mathematical functions
    • Semantic Drilldown - provides a w:faceted browser interface for viewing the semantic data in a wiki
    • Semantic Maps - displays geographic semantic data using various mapping services

    Open Food Facts[edit | edit source]

    Open Food Facts logo 2022

    Open Food Facts (.org) is a crowdsourced non-profit open database, web app and mobile apps of food products initiated in 2012 by French programmer Stéphane Gigandet. It is limited to foodstuffs only by the contract, others will be removed and clearly seeks not to be political.

    The database is copylefted under w:Open Database License, database entries under Database Contents License and the uploaded photos under CC-BY-3.0 Creative Commons so the database and the photos can be reused in other services so it would seem a good place to contribute as open data is available in various formats.

    w:Nutri-score is a nutritional grading system adopted by the French government in 2017. This is one type of information found in the Open Food Facts database because it is on the label. Nutri-score can be approximately calculated for also for products that do not have it on the packaging.

    Information verification is based on photos of the labeling on the package. Adding information not featured on the label seems to be forbidden, probably to avoid legal problems arising.

    In Wikipedia

    Apps

    Sites by OFF

    Interesting wiki articles

    Sites reusing OFF data set


    Spinoffs / expansions of scope


    DBpedia[edit | edit source]

    The DBpedia logo

    DBpedia (.org) is a community effort to enable the web moving "Towards a Public Data Infrastructure for a Large, Multilingual, Semantic Knowledge Graph".

    Today the DBpedia data sets contain a wealth of information structured into ontologies. This structured data can be queried with SPARQL query language at their public DBpedia SPARQL endpoint.

    DBpedia uses w:Virtuoso Universal Server to store and query the data. VirtuosoUniversalServer at w3.org

    There are many methods how the DBpedia ontology and datasets could be used in the Consumerium implementation stage wiki.

    Ontology classes useful for implementing Consumerium[edit | edit source]

    All DBpedia ontology classes

    DBpedia datasets[edit | edit source]

    DBpedia Databus[edit | edit source]

    The DBpedia Databus at databus.dbpedia.org is a data cataloging and versioning platform for data developers and consumers.

    SPARQL endpoint at databus.dbpedia.org

    DBpedia developed the DBpedia Databus in the late 2010's with a Databus alpha published in May 2018.

    History of DBpedia[edit | edit source]

    DBpedia began as an effort to extract structured information from Wikipedia infobox templates and categories and to make this information available on the Web with the initial release on January 10th 2007.

    More info on DBpedia[edit | edit source]


    LinkedWiki extension[edit | edit source]

    A possible way to tap into various knowledge graphs is the LinkedWiki extension. LinkedWiki has been developed since 2010 by mw:User:Karima Rafes, a #Semantic MediaWiki developer and CEO of BorderCloud.com

    See also

    References[edit | edit source]

    1. The Cambridge Encyclopedia of The English Language. Ed. [[w:David Crystal|]]. Cambridge: Cambridge University Press, 1995. p. 118. Template:ISBN.
    2. wikibooks:SPARQL/WIKIDATA Lexicographical data