Editing Knowledge graphs

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 1: Line 1:
Acquiring access for our consumers to a [[w:semantic network|semantic network]] of relevant [[w:linked data|linked]] [[w:open data|open data]] compiled by other efforts and structured by a number of [[w:ontology (information science)|ontologies]] is obviously key to Consumerium. Reciprocally we aim to share the information we gather and compile available to other efforts.
See also: [[mw:Manual:Managing data in MediaWiki]] at mediawiki.org
= Wikidata =
[[File:Wikidata-logo-en.svg|thumb|right|260px|The [https://wikidata.org Wikidata] logo]]
'''[[w:Wikidata|Wikidata]]''' [https://wikidata.org (.org)] is a [[w:knowledge base|knowledge base]], an effort to store and serve structured data to [[Wikimedia]] wikis and to a more limited extent to other parties. Wikidata effort saw the daylight in [[2012]].
The underlying software is the '''[[#Wikibase|Wikibase]]''' which consists of 2 [[Mediawiki]] extensions, the [[mw:Extension:Wikibase Repository|Wikibase Repository]] and the [[mw:Extension:Wikibase Client|Wikibase Client]].
Wikibase allows [[interwiki]] links to be managed with Wikidata removing much contributor annoyanges, redundancy and error-proneness.
Wikidata is obviously a very viable source of [[reference]] level data once it is technically possible for non-WMF wikis to access the data items. (See [[#LinkedWiki extension]] for a potential workaround for this limitation)
It can be accessed outside of WMF wikis with with
* [https://query.wikidata.org/ Wikidata's SPARQL endpoint] using [[SPARQL]] ([[wikidata:Wikidata:SPARQL_query_service/queries|Wikidata advice on how to query]]).
* [[mw:Wikidata Toolkit]] is a way for Java programs to access data in the Wikidata repository.
* [https://rdflib.github.io/sparqlwrapper/ SPARQL Endpoint interface to Python]
* [[wikidata:Wikidata:Database download|Wikidata database download]]
* The [https://tools.wmflabs.org/wikidata-game/# Wikidata Game] [[w:gamification|gamifies]] adding information to Wikidata.
Main entry point of any Wikidata item is a [[w:JSON|JSON]] dictionary, that has this form:
<code>{“labels”: by-language dictionary
“descriptions”: by-language dictionary
“aliases”: by-language dictionary
“claims”: list of property and values
“sitelinks”: by-language dictionary}</code>
== Lexicographical Wikidata ==
A '''lexeme''' is a unit of [[w:lexical semantics|lexical]] meaning that underlies a set of words that are related through [[w:inflection|inflection]]. It is a basic abstract unit of meaning,<ref>''The Cambridge Encyclopedia of The English Language''. Ed. [[w:David Crystal|]]. Cambridge: Cambridge University Press, 1995. p.&nbsp;118. {{ISBN|0521401798}}.</ref> a [[w:emic unit|unit]] of [[w:Morphology (linguistics)|morphological]] [[w:Semantic analysis (linguistics)|analysis]] in [[w:linguistics]] that roughly corresponds to a set of forms taken by a single root [[w:word]]. For example, in [[w:English language|English]], ''run'', ''runs'', ''ran'' and ''running'' are forms of the same lexeme, which can be represented as <span style="font-variant:small-caps; text-transform:lowercase;">RUN</span> (Wikipedia on 2019-12-29)
Since 2018, Wikidata has also stored a new type of data: words, phrases and sentences, in many languages, described in many languages. This information is stored in new types of entities, called '''Lexemes''' ('''L'''), '''Forms''' ('''F''') and '''Senses''' ('''S''').<ref>[[wikibooks:SPARQL/WIKIDATA Lexicographical data]]</ref>. This is enabled by [[mw:Extension:WikibaseLexeme|the WikibaseLexeme extension]].
* [[wikidata:Wikidata:Lexicographical_data/Documentation|The main documentation page for lexicographical data on Wikidata]]
* [[wikidata:Wikidata:Tools/Lexicographical data|Wikidata's list of lexicographical properties and tools using the lexicographical data]]
* [[wikibooks:SPARQL/WIKIDATA Lexicographical data|Wikibook on Wikidata's lexicographical data]]
== Useful information ==
* [[wikidata:Wikidata:Tools/External_tools|Wikidata's list of external tools that make use of Wikidata's knowledge]]
'''More info'''
* [[m:Wikidata|Metawiki on Wikidata]]
* [[w:Wikipedia:Wikidata|Wikipedia advice on Wikidata issues]]
== Wikibase ==
[[File:Wikibase_logo.png|thumb|right|250px|The [http://wikiba.se/ Wikibase] logo]]
'''[[mw:Wikibase|Wikibase]]''' [http://wikiba.se/ (wikiba.se)] is a system for storing and querying structured data that powers [[Wikidata]] and other wikis.
Wikibase consists of two extensions:
:# [[mw:Extension:Wikibase Repository|Wikibase Repository]] that allows a wiki to work as a repository for structured data.
:# [[mw:Extension:Wikibase Client|Wikibase Client]] that allows a wiki to access structured data from a repository. The client can work only with repository databases it can access so they must be on the same machine or the same load balancer.
=== Installation of Wikibase ===
[[mw:Wikibase/Installation|Wikibase installation instructions at Mediawiki.org]] and [[mw:Wikibase/Installation/Advanced_configuration|advanced configuration of Wikibase]].
The installation instructions assume you are installing the dependencies with [[mw:Composer|Composer]], a PHP package manager that makes the installation of dependencies easy.
=== Useful extensions in conjunction with Wikibase ===
* [[mw:Extension:ArticlePlaceholder]] makes article placeholders from repository data and invites users to create the article.
* [[mw:Extension:UniversalLanguageSelector]] is recommended to work in conjunction with Wikibase for user comfort.
== Alternative to using Wikibase Client ==
* [[#LinkedWiki extension]] can be configured to access multiple SPARQL endpoints.
=== Useful information ===
* [[mw:Wikibase/DataModel|The data model used in Wikibase]] (thorough) and [[mw:Wikibase/DataModel/Primer|the primer on the data model]] (quick access)
* [https://github.com/wikimedia/mediawiki-extensions-Wikibase/blob/master/docs/federation.wiki Information on the federation of Wikibase at Github]
* [https://phabricator.wikimedia.org/T159240 Phabricator Task to "Document how to set up federated Wikibase instances"]
* [https://phabricator.wikimedia.org/T196997 Phabricator Task to "Wikidata/Wikibase federation and distribution"] and the [https://docs.google.com/document/d/1YYIuQzcWz2cH9zTUbbfiBrTecVxMWAu3_9-U4DcKzEU/edit#heading=h.z49pje934vur Google Docs document charting out the task]
----
= Semantic MediaWiki =
[[File:SemanticMediaWiki_Logo.png|thumb|right|200px|The [https://www.semantic-mediawiki.org/wiki/Semantic_MediaWiki Semantic MediaWiki] logo]]
'''Semantic MediaWiki''' [https://www.semantic-mediawiki.org/wiki/Semantic_MediaWiki (.org)] (SMW) is a free, open-source [[MediaWiki extensions|extension]] to [[MediaWiki]] that lets you store & query semantic data within the wiki and it seems well suited to [[Consumerium]]'s information infrastructure needs.
== Spinoff extensions ==
A variety of open-source MediaWiki extensions exist that use the data structure provided by Semantic MediaWiki.
Among the most notable are of the [http://semantic-mediawiki.org/wiki/Help:SMW_extensions Semantic MediaWiki extensions]:
* [[mw:Extension:Semantic Forms|Semantic Forms]] - enables user-created forms for adding and editing pages that use semantic data
* [[mw:Extension:Semantic Result Formats|Semantic Result Formats]] - provides a large number of display formats for semantic data, including charts, graphs, calendars and mathematical functions
* [[mw:Extension:Semantic Drilldown|Semantic Drilldown]] - provides a [[w:faceted browser]] interface for viewing the semantic data in a wiki
* [[mw:Extension:Semantic Maps|Semantic Maps]] - displays geographic semantic data using various mapping services
----
= Open Food Facts =
{{:Open Food Facts}}
----
= DBpedia =
= DBpedia =
[[File:DBpediaLogo.svg|thumb|right|260px|The [https://www.dbpedia.org/ DBpedia] logo]]
'''[[w:DBpedia|DBpedia]]''' [http://dbpedia.org (.org)] is a community effort to enable the web moving ''"Towards a Public Data Infrastructure for a Large, Multilingual, Semantic Knowledge Graph"''.
'''[[w:DBpedia|DBpedia]]''' [https://www.dbpedia.org/ (.org)] is a community effort to enable the web moving ''"Towards a Public Data Infrastructure for a Large, Multilingual, Semantic Knowledge Graph"''.


Today the DBpedia '''[[w:data set|data set]]s''' contain a wealth of information structured into '''[[w:Ontology (information science)|ontologies]]'''. This [[w:structured data|structured data]] can be queried with [[SPARQL]] query language at their [http://dbpedia.org/sparql public DBpedia SPARQL endpoint].  
Today the DBpedia '''[[w:data set|data set]]s''' contain a wealth of information structured into '''[[w:Ontology (information science)|ontologies]]'''. This [[w:structured data|structured data]] can be queried with [[SPARQL]] query language at their [http://dbpedia.org/sparql public DBpedia SPARQL endpoint].  


DBpedia uses [[w:Virtuoso Universal Server]] to store and query the data. [https://www.w3.org/wiki/VirtuosoUniversalServer VirtuosoUniversalServer at w3.org]
There are many methods how the DBpedia ontology and datasets could be used in the Consumerium implementation stage wiki. One of these would be to install the [[mw:Extension:LinkedWiki|LinkedWiki extension]].
 
There are many methods how the DBpedia ontology and datasets could be used in the Consumerium implementation stage wiki.  


== Ontology classes useful for implementing Consumerium ==
== Ontology classes useful for implementing Consumerium ==
'''[https://dief.tools.dbpedia.org/server/ontology/classes/ All DBpedia ontology classes]'''
'''[http://mappings.dbpedia.org/server/ontology/classes/ All DBpedia ontology classes]'''


* [http://mappings.dbpedia.org/server/ontology/classes/Company Company]
* [http://mappings.dbpedia.org/server/ontology/classes/Company Company]
Line 150: Line 47:
* '''[http://wiki.dbpedia.org/Datasets DBpedia datasets]''' have been released annually, sometimes with improvements more frequently. The latest (as of July 2018) [https://wiki.dbpedia.org/develop/datasets/dbpedia-version-2016-10 DBpedia dataset version is 2016-10] which was published in 2017.
* '''[http://wiki.dbpedia.org/Datasets DBpedia datasets]''' have been released annually, sometimes with improvements more frequently. The latest (as of July 2018) [https://wiki.dbpedia.org/develop/datasets/dbpedia-version-2016-10 DBpedia dataset version is 2016-10] which was published in 2017.


== DBpedia Databus ==
== Databus ==
The [https://databus.dbpedia.org/ '''DBpedia Databus''' at databus.dbpedia.org] ''is a data cataloging and versioning platform for data developers and consumers.''
At DBpedia there is ongoing work on what is called '''[https://wiki.dbpedia.org/blog/dbpedia-databus-%E2%80%93-transforming-linked-data-networked-data-economy DBpedia Databus]''' to take their game to the next level. Databus Alpha was published in May 2018.


[https://databus.dbpedia.org/sparql '''SPARQL endpoint''' at databus.dbpedia.org]
== History of DBpedia ==
 
DBpedia developed the DBpedia Databus in the late 2010's with a Databus alpha published in May 2018.


== History of DBpedia ==
DBpedia began as an effort to extract structured information from [[Wikipedia]] [[templates|infobox templates]] and [[categories]] and to make this information available on the Web with the initial release on January 10th 2007.  
DBpedia began as an effort to extract structured information from [[Wikipedia]] [[templates|infobox templates]] and [[categories]] and to make this information available on the Web with the initial release on January 10th 2007.  
   
   
== More info on DBpedia ==
== More info on DBpedia ==
* [https://wiki.dbpedia.org/ DBpedia wiki]
* [https://wiki.dbpedia.org/ DBpedia wiki]
* [https://blog.dbpedia.org/ DBpedia blog]
* [https://blog.dbpedia.org/ DBpedia blog]
* [https://github.com/dbpedia/ DBpedia code at GitHub]
* [https://github.com/dbpedia/ DBpedia code at GitHub]
== See also ==
* [[Database]] for a higher level view of what is going on with the databases
* [[Semantic MediaWiki]] is system for inputting and querying semantic data within the MediaWiki and it is implemented as extension(s).
* [[Wikidata]] and the underlying [[Wikibase|Wikibase extensions]]


----
----
= LinkedWiki extension =
= Wikidata =
[[File:LogoLinkedWiki.png|thumb|right|260px|Logo of the [[mw:Extension:LinkedWiki|LinkedWiki extension]]]]
 
A possible way to tap into various knowledge graphs is the [[mw:Extension:LinkedWiki|LinkedWiki extension]]. LinkedWiki has been developed since 2010 by [[mw:User:Karima Rafes]], a [[#Semantic MediaWiki]] developer and CEO of [http://www.bordercloud.com/ BorderCloud.com]
'''[[w:Wikidata|Wikidata]]''' [http://wikidata.org (.org)] is a [[w:knowledge base|knowledge base]], an effort to store and serve structured data to [[Wikimedia]] wikis and to a more limited extent to other parties. Wikidata effort saw the daylight in [[2012]].
 
The underlying software is the '''[[Wikibase]]''' which consists of 2 [[Mediawiki]] extensions, the repository extension and the client extension.
 
Wikibase allows [[interwiki]] links to be managed with Wikidata removing much contributor annoyanges, redundancy and error-proneness.
 
Wikidata is obviously a main source of [[reference]] level data once it is technically possible for non-WMF wikis to access the data items.
 
It can be accessed outside of WMF wikis with with
* [[wikidata:Wikidata:SPARQL_query_service/queries|Wikidata advice on how to query]] [https://query.wikidata.org/ Wikidata's endpoint] using [[SPARQL]].
* [[mw:Wikidata Toolkit]] is a way for Java programs to access data in the Wikidata repository.
* [https://rdflib.github.io/sparqlwrapper/ SPARQL Endpoint interface to Python]
* [[wikidata:Wikidata:Database download|Wikidata offers its database for download]]
 
Main entry point of any Wikidata item is a [[w:JSON|JSON]] dictionary, that has this form:
 
<code>{“labels”: by-language dictionary
 
“descriptions”: by-language dictionary
 
“aliases”: by-language dictionary
 
“claims”: list of property and values
 
“sitelinks”: by-language dictionary}</code>


''' See also '''
== Useful information ==
* [https://linkedwiki.com/ LinkedWiki.com - The best place to discover available Linked Data and to share SPARQL queries]
 
* [[Database]] for a higher level view of what is going on with the databases
* [[wikidata:Wikidata:Tools/External_tools|Wikidata's list of external tools that make use of Wikidata's knowledge]]


= References =
'''More info'''
<references/>
* [[m:Wikidata|Metawiki on Wikidata]]
* [[w:Wikipedia:Wikidata|Wikipedia advice on Wikidata issues]]


[[Category:List]]
== See also ==
[[Category:Technology]]
* [[Database]] for a higher level view of what is going on with the databases
* [[Semantic MediaWiki]] is system for inputting and querying semantic data within the MediaWiki and it is implemented as extension(s).
* [[DBpedia]] the original structured data harvesting effort for the MediaWiki wikis
Please note that all contributions to Consumerium development wiki are considered to be released under the GNU Free Documentation License 1.3 or later (see Consumerium:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)

Templates used on this page: