Latest revision |
Your text |
Line 1: |
Line 1: |
| Acquiring access for our consumers to a [[w:semantic network|semantic network]] of relevant [[w:linked data|linked]] [[w:open data|open data]] compiled by other efforts and structured by a number of [[w:ontology (information science)|ontologies]] is obviously key to Consumerium. Reciprocally we aim to share the information we gather and compile available to other efforts.
| |
|
| |
| See also: [[mw:Manual:Managing data in MediaWiki]] at mediawiki.org
| |
|
| |
| = Wikidata =
| |
| [[File:Wikidata-logo-en.svg|thumb|right|260px|The [https://wikidata.org Wikidata] logo]]
| |
|
| |
| '''[[w:Wikidata|Wikidata]]''' [https://wikidata.org (.org)] is a [[w:knowledge base|knowledge base]], an effort to store and serve structured data to [[Wikimedia]] wikis and to a more limited extent to other parties. Wikidata effort saw the daylight in [[2012]].
| |
|
| |
| The underlying software is the '''[[#Wikibase|Wikibase]]''' which consists of 2 [[Mediawiki]] extensions, the [[mw:Extension:Wikibase Repository|Wikibase Repository]] and the [[mw:Extension:Wikibase Client|Wikibase Client]].
| |
|
| |
| Wikibase allows [[interwiki]] links to be managed with Wikidata removing much contributor annoyanges, redundancy and error-proneness.
| |
|
| |
| Wikidata is obviously a very viable source of [[reference]] level data once it is technically possible for non-WMF wikis to access the data items. (See [[#LinkedWiki extension]] for a potential workaround for this limitation)
| |
|
| |
| It can be accessed outside of WMF wikis with with
| |
| * [https://query.wikidata.org/ Wikidata's SPARQL endpoint] using [[SPARQL]] ([[wikidata:Wikidata:SPARQL_query_service/queries|Wikidata advice on how to query]]).
| |
| * [[mw:Wikidata Toolkit]] is a way for Java programs to access data in the Wikidata repository.
| |
| * [https://rdflib.github.io/sparqlwrapper/ SPARQL Endpoint interface to Python]
| |
| * [[wikidata:Wikidata:Database download|Wikidata database download]]
| |
|
| |
| * The [https://tools.wmflabs.org/wikidata-game/# Wikidata Game] [[w:gamification|gamifies]] adding information to Wikidata.
| |
|
| |
|
| |
| Main entry point of any Wikidata item is a [[w:JSON|JSON]] dictionary, that has this form:
| |
|
| |
| <code>{“labels”: by-language dictionary
| |
|
| |
| “descriptions”: by-language dictionary
| |
|
| |
| “aliases”: by-language dictionary
| |
|
| |
| “claims”: list of property and values
| |
|
| |
| “sitelinks”: by-language dictionary}</code>
| |
|
| |
| == Lexicographical Wikidata ==
| |
|
| |
| A '''lexeme''' is a unit of [[w:lexical semantics|lexical]] meaning that underlies a set of words that are related through [[w:inflection|inflection]]. It is a basic abstract unit of meaning,<ref>''The Cambridge Encyclopedia of The English Language''. Ed. [[w:David Crystal|]]. Cambridge: Cambridge University Press, 1995. p. 118. {{ISBN|0521401798}}.</ref> a [[w:emic unit|unit]] of [[w:Morphology (linguistics)|morphological]] [[w:Semantic analysis (linguistics)|analysis]] in [[w:linguistics]] that roughly corresponds to a set of forms taken by a single root [[w:word]]. For example, in [[w:English language|English]], ''run'', ''runs'', ''ran'' and ''running'' are forms of the same lexeme, which can be represented as <span style="font-variant:small-caps; text-transform:lowercase;">RUN</span> (Wikipedia on 2019-12-29)
| |
|
| |
| Since 2018, Wikidata has also stored a new type of data: words, phrases and sentences, in many languages, described in many languages. This information is stored in new types of entities, called '''Lexemes''' ('''L'''), '''Forms''' ('''F''') and '''Senses''' ('''S''').<ref>[[wikibooks:SPARQL/WIKIDATA Lexicographical data]]</ref>. This is enabled by [[mw:Extension:WikibaseLexeme|the WikibaseLexeme extension]].
| |
|
| |
| * [[wikidata:Wikidata:Lexicographical_data/Documentation|The main documentation page for lexicographical data on Wikidata]]
| |
| * [[wikidata:Wikidata:Tools/Lexicographical data|Wikidata's list of lexicographical properties and tools using the lexicographical data]]
| |
| * [[wikibooks:SPARQL/WIKIDATA Lexicographical data|Wikibook on Wikidata's lexicographical data]]
| |
|
| |
| == Useful information ==
| |
|
| |
| * [[wikidata:Wikidata:Tools/External_tools|Wikidata's list of external tools that make use of Wikidata's knowledge]]
| |
|
| |
| '''More info'''
| |
| * [[m:Wikidata|Metawiki on Wikidata]]
| |
| * [[w:Wikipedia:Wikidata|Wikipedia advice on Wikidata issues]]
| |
|
| |
| == Wikibase ==
| |
| [[File:Wikibase_logo.png|thumb|right|250px|The [http://wikiba.se/ Wikibase] logo]]
| |
| '''[[mw:Wikibase|Wikibase]]''' [http://wikiba.se/ (wikiba.se)] is a system for storing and querying structured data that powers [[Wikidata]] and other wikis.
| |
|
| |
| Wikibase consists of two extensions:
| |
|
| |
| :# [[mw:Extension:Wikibase Repository|Wikibase Repository]] that allows a wiki to work as a repository for structured data.
| |
| :# [[mw:Extension:Wikibase Client|Wikibase Client]] that allows a wiki to access structured data from a repository. The client can work only with repository databases it can access so they must be on the same machine or the same load balancer.
| |
|
| |
| === Installation of Wikibase ===
| |
| [[mw:Wikibase/Installation|Wikibase installation instructions at Mediawiki.org]] and [[mw:Wikibase/Installation/Advanced_configuration|advanced configuration of Wikibase]].
| |
|
| |
| The installation instructions assume you are installing the dependencies with [[mw:Composer|Composer]], a PHP package manager that makes the installation of dependencies easy.
| |
|
| |
| === Useful extensions in conjunction with Wikibase ===
| |
| * [[mw:Extension:ArticlePlaceholder]] makes article placeholders from repository data and invites users to create the article.
| |
| * [[mw:Extension:UniversalLanguageSelector]] is recommended to work in conjunction with Wikibase for user comfort.
| |
|
| |
| == Alternative to using Wikibase Client ==
| |
| * [[#LinkedWiki extension]] can be configured to access multiple SPARQL endpoints.
| |
|
| |
| === Useful information ===
| |
| * [[mw:Wikibase/DataModel|The data model used in Wikibase]] (thorough) and [[mw:Wikibase/DataModel/Primer|the primer on the data model]] (quick access)
| |
| * [https://github.com/wikimedia/mediawiki-extensions-Wikibase/blob/master/docs/federation.wiki Information on the federation of Wikibase at Github]
| |
| * [https://phabricator.wikimedia.org/T159240 Phabricator Task to "Document how to set up federated Wikibase instances"]
| |
| * [https://phabricator.wikimedia.org/T196997 Phabricator Task to "Wikidata/Wikibase federation and distribution"] and the [https://docs.google.com/document/d/1YYIuQzcWz2cH9zTUbbfiBrTecVxMWAu3_9-U4DcKzEU/edit#heading=h.z49pje934vur Google Docs document charting out the task]
| |
| ----
| |
|
| |
| = Semantic MediaWiki =
| |
| [[File:SemanticMediaWiki_Logo.png|thumb|right|200px|The [https://www.semantic-mediawiki.org/wiki/Semantic_MediaWiki Semantic MediaWiki] logo]]
| |
|
| |
| '''Semantic MediaWiki''' [https://www.semantic-mediawiki.org/wiki/Semantic_MediaWiki (.org)] (SMW) is a free, open-source [[MediaWiki extensions|extension]] to [[MediaWiki]] that lets you store & query semantic data within the wiki and it seems well suited to [[Consumerium]]'s information infrastructure needs.
| |
|
| |
| == Spinoff extensions ==
| |
| A variety of open-source MediaWiki extensions exist that use the data structure provided by Semantic MediaWiki.
| |
|
| |
| Among the most notable are of the [http://semantic-mediawiki.org/wiki/Help:SMW_extensions Semantic MediaWiki extensions]:
| |
| * [[mw:Extension:Semantic Forms|Semantic Forms]] - enables user-created forms for adding and editing pages that use semantic data
| |
| * [[mw:Extension:Semantic Result Formats|Semantic Result Formats]] - provides a large number of display formats for semantic data, including charts, graphs, calendars and mathematical functions
| |
| * [[mw:Extension:Semantic Drilldown|Semantic Drilldown]] - provides a [[w:faceted browser]] interface for viewing the semantic data in a wiki
| |
| * [[mw:Extension:Semantic Maps|Semantic Maps]] - displays geographic semantic data using various mapping services
| |
|
| |
| ----
| |
| = Open Food Facts =
| |
| {{:Open Food Facts}}
| |
| ----
| |
| = DBpedia = | | = DBpedia = |
| [[File:DBpediaLogo.svg|thumb|right|260px|The [https://www.dbpedia.org/ DBpedia] logo]]
| | '''[[w:DBpedia|DBpedia]]''' [http://dbpedia.org (.org)] is a community effort to enable the web moving ''"Towards a Public Data Infrastructure for a Large, Multilingual, Semantic Knowledge Graph"''. |
| '''[[w:DBpedia|DBpedia]]''' [https://www.dbpedia.org/ (.org)] is a community effort to enable the web moving ''"Towards a Public Data Infrastructure for a Large, Multilingual, Semantic Knowledge Graph"''. | |
|
| |
|
| Today the DBpedia '''[[w:data set|data set]]s''' contain a wealth of information structured into '''[[w:Ontology (information science)|ontologies]]'''. This [[w:structured data|structured data]] can be queried with [[SPARQL]] query language at their [http://dbpedia.org/sparql public DBpedia SPARQL endpoint]. | | Today the DBpedia '''[[w:data set|data set]]s''' contain a wealth of information structured into '''[[w:Ontology (information science)|ontologies]]'''. This [[w:structured data|structured data]] can be queried with [[SPARQL]] query language at their [http://dbpedia.org/sparql public DBpedia SPARQL endpoint]. |
|
| |
|
| DBpedia uses [[w:Virtuoso Universal Server]] to store and query the data. [https://www.w3.org/wiki/VirtuosoUniversalServer VirtuosoUniversalServer at w3.org]
| | There are many methods how the DBpedia ontology and datasets could be used in the Consumerium implementation stage wiki. One of these would be to install the [[mw:Extension:LinkedWiki|LinkedWiki extension]]. |
| | |
| There are many methods how the DBpedia ontology and datasets could be used in the Consumerium implementation stage wiki. | |
|
| |
|
| == Ontology classes useful for implementing Consumerium == | | == Ontology classes useful for implementing Consumerium == |
| '''[https://dief.tools.dbpedia.org/server/ontology/classes/ All DBpedia ontology classes]''' | | '''[http://mappings.dbpedia.org/server/ontology/classes/ All DBpedia ontology classes]''' |
|
| |
|
| * [http://mappings.dbpedia.org/server/ontology/classes/Company Company] | | * [http://mappings.dbpedia.org/server/ontology/classes/Company Company] |
Line 150: |
Line 47: |
| * '''[http://wiki.dbpedia.org/Datasets DBpedia datasets]''' have been released annually, sometimes with improvements more frequently. The latest (as of July 2018) [https://wiki.dbpedia.org/develop/datasets/dbpedia-version-2016-10 DBpedia dataset version is 2016-10] which was published in 2017. | | * '''[http://wiki.dbpedia.org/Datasets DBpedia datasets]''' have been released annually, sometimes with improvements more frequently. The latest (as of July 2018) [https://wiki.dbpedia.org/develop/datasets/dbpedia-version-2016-10 DBpedia dataset version is 2016-10] which was published in 2017. |
|
| |
|
| == DBpedia Databus == | | == Databus == |
| The [https://databus.dbpedia.org/ '''DBpedia Databus''' at databus.dbpedia.org] ''is a data cataloging and versioning platform for data developers and consumers.''
| | At DBpedia there is ongoing work on what is called '''[https://wiki.dbpedia.org/blog/dbpedia-databus-%E2%80%93-transforming-linked-data-networked-data-economy DBpedia Databus]''' to take their game to the next level. Databus Alpha was published in May 2018. |
|
| |
|
| [https://databus.dbpedia.org/sparql '''SPARQL endpoint''' at databus.dbpedia.org]
| | == History of DBpedia == |
| | |
| DBpedia developed the DBpedia Databus in the late 2010's with a Databus alpha published in May 2018. | |
|
| |
|
| == History of DBpedia ==
| |
| DBpedia began as an effort to extract structured information from [[Wikipedia]] [[templates|infobox templates]] and [[categories]] and to make this information available on the Web with the initial release on January 10th 2007. | | DBpedia began as an effort to extract structured information from [[Wikipedia]] [[templates|infobox templates]] and [[categories]] and to make this information available on the Web with the initial release on January 10th 2007. |
| | | |
| == More info on DBpedia == | | == More info on DBpedia == |
| * [https://wiki.dbpedia.org/ DBpedia wiki] | | * [https://wiki.dbpedia.org/ DBpedia wiki] |
| * [https://blog.dbpedia.org/ DBpedia blog] | | * [https://blog.dbpedia.org/ DBpedia blog] |
| * [https://github.com/dbpedia/ DBpedia code at GitHub] | | * [https://github.com/dbpedia/ DBpedia code at GitHub] |
| | |
| | == See also == |
| | * [[Database]] for a higher level view of what is going on with the databases |
| | * [[Semantic MediaWiki]] is system for inputting and querying semantic data within the MediaWiki and it is implemented as extension(s). |
| | * [[Wikidata]] and the underlying [[Wikibase|Wikibase extensions]] |
|
| |
|
| ---- | | ---- |
| = LinkedWiki extension = | | = Wikidata = |
| [[File:LogoLinkedWiki.png|thumb|right|260px|Logo of the [[mw:Extension:LinkedWiki|LinkedWiki extension]]]] | | |
| A possible way to tap into various knowledge graphs is the [[mw:Extension:LinkedWiki|LinkedWiki extension]]. LinkedWiki has been developed since 2010 by [[mw:User:Karima Rafes]], a [[#Semantic MediaWiki]] developer and CEO of [http://www.bordercloud.com/ BorderCloud.com]
| | '''[[w:Wikidata|Wikidata]]''' [http://wikidata.org (.org)] is a [[w:knowledge base|knowledge base]], an effort to store and serve structured data to [[Wikimedia]] wikis and to a more limited extent to other parties. Wikidata effort saw the daylight in [[2012]]. |
| | |
| | The underlying software is the '''[[Wikibase]]''' which consists of 2 [[Mediawiki]] extensions, the repository extension and the client extension. |
| | |
| | Wikibase allows [[interwiki]] links to be managed with Wikidata removing much contributor annoyanges, redundancy and error-proneness. |
| | |
| | Wikidata is obviously a main source of [[reference]] level data once it is technically possible for non-WMF wikis to access the data items. |
| | |
| | It can be accessed outside of WMF wikis with with |
| | * [[wikidata:Wikidata:SPARQL_query_service/queries|Wikidata advice on how to query]] [https://query.wikidata.org/ Wikidata's endpoint] using [[SPARQL]]. |
| | * [[mw:Wikidata Toolkit]] is a way for Java programs to access data in the Wikidata repository. |
| | * [https://rdflib.github.io/sparqlwrapper/ SPARQL Endpoint interface to Python] |
| | * [[wikidata:Wikidata:Database download|Wikidata offers its database for download]] |
| | |
| | Main entry point of any Wikidata item is a [[w:JSON|JSON]] dictionary, that has this form: |
| | |
| | <code>{“labels”: by-language dictionary |
| | |
| | “descriptions”: by-language dictionary |
| | |
| | “aliases”: by-language dictionary |
| | |
| | “claims”: list of property and values |
| | |
| | “sitelinks”: by-language dictionary}</code> |
|
| |
|
| ''' See also '''
| | == Useful information == |
| * [https://linkedwiki.com/ LinkedWiki.com - The best place to discover available Linked Data and to share SPARQL queries] | | |
| * [[Database]] for a higher level view of what is going on with the databases
| | * [[wikidata:Wikidata:Tools/External_tools|Wikidata's list of external tools that make use of Wikidata's knowledge]] |
|
| |
|
| = References =
| | '''More info''' |
| <references/>
| | * [[m:Wikidata|Metawiki on Wikidata]] |
| | * [[w:Wikipedia:Wikidata|Wikipedia advice on Wikidata issues]] |
|
| |
|
| [[Category:List]] | | == See also == |
| [[Category:Technology]] | | * [[Database]] for a higher level view of what is going on with the databases |
| | * [[Semantic MediaWiki]] is system for inputting and querying semantic data within the MediaWiki and it is implemented as extension(s). |
| | * [[DBpedia]] the original structured data harvesting effort for the MediaWiki wikis |