Difference between revisions of "Lab: RDF programming with RDFlib"

From Info216
Line 8: Line 8:
import [https://rdflib.readthedocs.io/en/stable/py-modindex.html rdflib]:
import [https://rdflib.readthedocs.io/en/stable/py-modindex.html rdflib]:
*Graph (add, remove, serialize)
*Graph (add, remove, serialize, parse)

Revision as of 17:01, 23 January 2020

Lab 2: RDF programming with RDFlib


  • Basic RDF graph programming with RDFlib.
  • Simple reading/writing from/to file.


import rdflib:

  • Graph (add, remove, serialize, parse)
  • URIRef
  • Literal
  • NameSpace
  • Bnode


Consider the following situation: "Cade lives in 1516 Henry Street, Berkeley, California 94709, USA. He has a B.Sc. in biology from the University of California, Berkeley from 2011. His interests include birds, ecology, the environment, photography and travelling. He has visited Canada and France. Emma Dominguez lives in Carrer de la Guardia Civil 20, 46020 Valencia, Spain. She has a M.Sc. in chemistry from the University of Valencia from 2015. Her areas of expertise include waste management, toxic waste, air pollution. Her interests include bike riding, music and travelling. She has visited Portugal, Italy, France, Germany, Denmark and Sweden. Cade knows Emma. They met in Paris in August 2014." (Make up your own URIs when you need to (like "http://example.org/"), or even better: use terms you know from vocabularies such as FOAF and RDF.

Create a graph in RDFlib with triples corresponding to the text above.

Write out your graph to the console in the following formats: TURTLE, N-TRIPLE, N3, JSON-LD, RDF/XML. How do they differ? What is the default?

Write your graph to a file.

Edit the file so that Cade has the middle name Creighton and the middle name of Emma is María.

Create a new program that reads your graph in again from the file and writes it to the console. Check that the new middle names are there!

Continuing with either your first or second program, write a loop that goes through all the triples in the graph and prints them to the console.

Change the loop so that (a) it only loops through triples about Emma (b) it only loops through triples involving the names of people.

Remove all addresses from the graph.

If you have more time...

Below are four lines of comma-separated values (csv - five lines with the headers) that could have been saved from a spreadsheet. Copy them into a file and write a program with a loop that reads each line from that file (except the initial header line) and adds it to your graph as triples:

 "Regina Catherine Hall","F","Great Britain","Manchester","Ecology, zoology","Football, music travelling"
 "Achille Blaise","M","France","Nancy","","Chess, computer games"
 "Nyarai Awotwi Ihejirika","F","Kenya","Nairobi","Computers, semantic networks","Hiking, botany"
 "Xun He Zhang","M","China","Chengdu","Internet, mathematics, logistics","Dancing, music, trombone"

In the resulting graph, delete all information about Achille.

Have you used all the classes/interfaces listed at the beginning in your code, and all the variants of all the methods? If not, try to change your code to try them all!