Systems selections: Difference between revisions

    From Consumerium development wiki R&D Wiki
    (→‎Data sources: + https://www.w3.org/wiki/SparqlEndpoints)
    Line 70: Line 70:


    === Data sources: Semantic web ===
    === Data sources: Semantic web ===
    * Other copyleft Semantic MediaWikis
    * Other copyleft Semantic MediaWikis  
    * '''[[DBpedia]]''' [http://dbpedia.org/sparql Public SPARQL endpoint of DBpedia.org]
    ** [http://smw.referata.com/wiki/Special:BrowseData/Sites Semantic MediaWiki Community wiki list of SMW sites]
    ** [https://wikiapiary.com/wiki/Semantic_statistics Wikiapiary list of SMW sites]
    ** [http://wikiindex.org/Category:Semantic_MediaWiki Wikiindex.org category of SMWs]
     
     
    * '''[[DBpedia]]''' [http://dbpedia.org/sparql Public SPARQL endpoint of DBpedia.org] and [http://wiki.dbpedia.org/projects Projects and efforts using DBpedia] are very relevant.
    * '''[[Wikidata]]''' [https://query.wikidata.org/ Public SPARQL endpoint of Wikidata.org (Wikidata Query Service)]
    * '''[[Wikidata]]''' [https://query.wikidata.org/ Public SPARQL endpoint of Wikidata.org (Wikidata Query Service)]



    Revision as of 12:37, 30 August 2016

    Factors affecting database systems selections

    • Copyleft 'free to modify' and 'free in cost' software is strongly preferred over other solutions.
    • Minimal modifications required could be another preference as that means maintenance of the chosen solution is minimized in that aspect.

    Encodings

    • Integers w:BASE10 for integers, size depends on available solutions. Escape integer ceiling by bluntly encoding the integer as string of BASE-10 chars, dirty hack but could do the trick.
    • Condense URL encoding w:BASE64 and 64 possible characters, exactly 6 bits from w:ISO/IEC 646 (hard compatible with 7-bit ASCII)
    • w:UTF-8 encoded w:Unicode for text. UTF-8 is efficient and allows 2^32 different characters.. unless escaped to 40-bit in future revision

    Languages

    Programming languages

    Currently used languages

    Considered languages / languages of interest

    Databases

    Current databases

    Main article: Database

    Currently using all major free software RDBMS:

    Considered databases


    Frameworks

    Frameworks


    Software

    Main article: Software

    Current software

    ...


    APIs

    APIs to communicate with:

    • Free social media ones

    Data sources

    Data sources: own

    • Semantic MediaWiki well thought out and set up on implementation wiki (c.consumerium.org)
    • Development Wiki - User database, userspace sharing or importing and Unified Login are being looked at as options.

    Data sources: Semantic web


    Lists of Sparql endpoints


    Media sources

    Copyleft: