Wikidata presentation at SemTechBiz Berlin 2012

Post on 06-May-2015

3.470 views 1 download

Transcript of Wikidata presentation at SemTechBiz Berlin 2012

KIT – University of the State of Baden-Württemberg andNational Large-scale Research Center of the Helmholtz Association

Institut AIFB – Angewandte Informatik und Formale Beschreibungsverfahren

www.kit.edu

WikidataThe next big thing for Wikipedia

SemTechBiz Berlin, February 2012Denny VrandečićKIT Karlsruhe Institute of Technology / Wikimedia Deutschland

Denny Vrandečić2 07.02.2012 Wikidata

Denny Vrandečić3 07.02.2012 Wikidata 3

Imagine a worldin which

every single personis given free access

to the sum ofall human knowledge.

Denny Vrandečić4 07.02.2012 Wikidata

about 500 Million views per day

Denny Vrandečić5 07.02.2012 Wikidata

Denny Vrandečić6 07.02.2012 Wikidata

Denny Vrandečić7 07.02.2012 Wikidata

Top 200 Website

12M+ media files

All free to use

Denny Vrandečić8 07.02.2012 Wikidata

Denny Vrandečić9 07.02.2012 Wikidata

Denny Vrandečić10 07.02.2012 Wikidata

Denny Vrandečić11 07.02.2012 Wikidata

Denny Vrandečić12 07.02.2012 Wikidata

Denny Vrandečić13 07.02.2012 Wikidata

20M+ articles

1B+ edits

280+ languages

Denny Vrandečić14 07.02.2012 Wikidata

Coverage by language

English, German, French, Dutch: 1 Mio+

40 languages: 100,000+

107 languages: 10.000+

But what about other languages?

Denny Vrandečić15 07.02.2012 Wikidata

English

Denny Vrandečić16 07.02.2012 Wikidata

French

Denny Vrandečić17 07.02.2012 Wikidata

Italian

Denny Vrandečić18 07.02.2012 Wikidata

Catalan

Denny Vrandečić19 07.02.2012 Wikidata

Greek

Denny Vrandečić20 07.02.2012 Wikidata

Russian

Denny Vrandečić21 07.02.2012 Wikidata

Chinese

Denny Vrandečić22 07.02.2012 Wikidata

What Wikipedia knows

Wikipedia has articles about…… all cities

… their populations

… their mayors

So can I ask for a list of the world’s ten largest cities with a female mayor?

Denny Vrandečić23 07.02.2012 Wikidata

Let’s see what happens…

Denny Vrandečić24 07.02.2012 Wikidata

WIKIPEDIA’S ANSWER: LISTS

Denny Vrandečić25 07.02.2012 Wikidata

Denny Vrandečić26 07.02.2012 Wikidata

Denny Vrandečić27 07.02.2012 Wikidata

Denny Vrandečić28 07.02.2012 Wikidata

Denny Vrandečić29 07.02.2012 Wikidata

Denny Vrandečić30 07.02.2012 Wikidata

Denny Vrandečić31 07.02.2012 Wikidata

Denny Vrandečić32 07.02.2012 Wikidata

Denny Vrandečić33 07.02.2012 Wikidata

Denny Vrandečić34 07.02.2012 Wikidata

Denny Vrandečić35 07.02.2012 Wikidata

Denny Vrandečić36 07.02.2012 Wikidata

Denny Vrandečić37 07.02.2012 Wikidata

Denny Vrandečić38 07.02.2012 Wikidata

Denny Vrandečić39 07.02.2012 Wikidata

Denny Vrandečić40 07.02.2012 Wikidata

Denny Vrandečić41 07.02.2012 Wikidata

COMPUTERS ARE STUPID

Denny Vrandečić42 07.02.2012 Wikidata

What humans see

Denny Vrandečić43 07.02.2012 Wikidata

What humans see

Berlin

... has a population of 3,490,445

... is located in Germany

... has mayor Klaus Wowereit

... has an area of 892 km2

Denny Vrandečić44 07.02.2012 Wikidata

What computers see

Denny Vrandečić45 07.02.2012 Wikidata

What computers see

Berlin

3,490,445

Germany

892 km2

Denny Vrandečić46 07.02.2012 Wikidata

COMPUTERS DON‘T MAKE CONNECTIONS

Denny Vrandečić47 07.02.2012 Wikidata

COMPUTERS NEED OUR HELP

Denny Vrandečić48 07.02.2012 Wikidata

Capital of Germany

Also known as: City of BerlinMain pageContentsAccess the APIRandom pageDonate to Wikidata

InteractionHelpAbout WikidataCommunity portalRecent changes

LanguagesCataláCeskyDanskDeutschEestiEspañolEsperantoFrançaisHrvatskiItalianoO’zbekComplete list

BerlinFrom Wikidata

edit | x

Continent Europe [3 sources]

Country Germany [2 sources]

Population 3,490,445 [1 source]

3,500,000 [2 sources]

[other values]

Calling code 030 [2 sources]

Mayor Klaus W| [0 sources]

Vehicle registration B [1 source]

Area 891.85 km” [2 sources]

Twin city Los Angeles [3 sources]

[new fact]

Klaus WowereitGerman politicianKlaus WunderlichGerman musicianKlaus WaldeckAustrian musician and former lawyerKlaus WagnerGerman mathematicianKlaus WagnerStalker of the British Royal Family

edit

edit

Denny Vrandečić49 07.02.2012 Wikidata

Hauptstadt von Deutschland

Auch bekannt als: Stadt BerlinHauptseiteInhaltAPIZufällige SeiteSpende an Wikidata

InteraktionHilfeÜber WikidataBenutzerportalLetze Änderungen

Sprachen CataláCeskyDanskEestiEnglishEspañolEsperantoFrançaisHrvatskiItalianoO’zbekComplete list

BerlinFrom Wikidata

edit | x

Kontinent Europa [3 sources]

Land Deutschland [2 sources]

Einwohner 3.490.445 [1 source]

3.500.000 [2 sources]

[weitere Werte]

Telefonvorwahl 030 [2 sources]

Bürgermeister Klaus Wowereit [2 sources]

Amtliches Kennzeichen B [1 source]

Fläche 891,85 km” [2 sources]

Parnerstadt Los Angeles [3 sources]

[new fact]

edit

edit

Denny Vrandečić50 07.02.2012 Wikidata

Berlin Continent Europe. Berlin Country Germany. Berlin Population 3490445. Berlin Calling_code 030. Berlin Vehicle_registration B. Berlin Mayor Klaus_Wowereit. Berlin Twin_city Los_Angeles.

Denny Vrandečić51 07.02.2012 Wikidata

Berlin

Klaus Wowereit

Mayor

Denny Vrandečić52 07.02.2012 Wikidata

WikiData

Provide a database of the world’s knowledge that anyone can edit

Collect references and quotes for millions of data items

Engage a sustainable community that collects data from everywhere in a machine-readable way

Increase the quality and lower the maintenance costs of Wikipedia and related projects

Deliver software and community best practices enabling others to engage in projects of data collection and provisioning

Denny Vrandečić53 07.02.2012 Wikidata

Extracts facts from Wikipedia infoboxes

Publishes them in RDF

Shows potential of machine-readable data

Denny Vrandečić54 07.02.2012 Wikidata

WikiData

Provide a database of the world’s knowledge that anyone can edit

Collect references and quotes for millions of data items

Engage a sustainable community that collects data from everywhere in a machine-readable way

Increase the quality and lower the maintenance costs of Wikipedia and related projects

Deliver software and community best practices enabling others to engage in projects of data collection and provisioning

Denny Vrandečić55 07.02.2012 Wikidata

Secondary database

Sources for every fact

Reflect diversity

Denny Vrandečić56 07.02.2012 Wikidata

WikiData

Provide a database of the world’s knowledge that anyone can edit

Collect references and quotes for millions of data items

Engage a sustainable community that collects data from everywhere in a machine-readable way

Increase the quality and lower the maintenance costs of Wikipedia and related projects

Deliver software and community best practices enabling others to engage in projects of data collection and provisioning

Denny Vrandečić57 07.02.2012 Wikidata

Project plan: 3 phases

Phase 1: Language links

Phase 2: Infobox augmentation

Phase 3: Inline queries

Denny Vrandečić58 07.02.2012 Wikidata

Phase 1: Language links

Current: every language links to every other

In Wikidata: create one page for each entity, list representations in each language

In Wikipedias: pull language links from Wikidata

Denny Vrandečić59 07.02.2012 Wikidata

Phase 2: Infobox augmentation

Current: each article calls an infobox with values

In Wikidata: centralize the values

In Wikipedias: just call the infobox and populate it with values from Wikidata

Denny Vrandečić60 07.02.2012 Wikidata

Phase 3: Inline queries

Enable inline queries in WikipediasWith several formats

Denny Vrandečić61 07.02.2012 Wikidata

Open source project

400+ usersNASA, Europeana, Deutsche Telekom, …

20+ languages

World-wide community

Commercial support

Many extensions

semantic-mediawiki.org

Denny Vrandečić62 07.02.2012 Wikidata

Denny Vrandečić63 07.02.2012 Wikidata

Conclusions

Editable, common resource for data

Enables much smaller contribution size

Freely reusable, machine-readable data

Able to answer question

Available in 280+ languages

Denny Vrandečić64 07.02.2012 Wikidata 64

Imagine a worldin which

every single personis given free access

to the sum ofall human knowledge.

KIT – University of the State of Baden-Württemberg andNational Large-scale Research Center of the Helmholtz Association

Institut AIFB – Angewandte Informatik und Formale Beschreibungsverfahren

www.kit.edu

Thank you!http://meta.wikipedia.org/wiki/Wikidata_WMDE

presenting work done by Markus Krötzsch, Yaron Koren, Daniel Kinzler, Qamarniso Ismoilova, Sergey Chernishev, Max Völkel, Heiko Haller, Sebastian Blohm, Philipp Sorg, Peter Haase, Than Tran, Basil Ell, Daniel Herzig, Benedikt Kämpgen, Elena Simperl, Delia Rusu, Marko Grobelnik, Michael Cariaso, Amélie Cordier, Jean Lieber, Emmanuel Nauer, Yannick Toussaint, Pascal Molli, Hala Skaf-Molli, Joel Natividad, Daniel Hansch and the Ontoprise team, and many others