We are just about done with an end-to-end workflow pattern that enables reconstitution of DBpedia 3.2 instances in the Clouds courtesy of Virtuoso and EC2.
Basically this is how it works.
- Instantiate a Virtuoso EC2 AMI (paid variety)
- Install the special EC2 extensions (ec2ext_dav.vad) VAD via the Conductor UI or iSQL
- Restore the Virtuoso+DBpedia backup from our S3 bucket
- After approx. 1 hr, you will have a complete DBpedia replica in your own data space on the Linked Data Web.
DBpedia replica implies:
-
SPARQL Endpoint
- Linked Data Viewer Pages (as you see in the public DBpedia instance)
- All requisite re-write rules for URI de-referencing and attribution (i.e., low cost triples that links back to main DBpedia using terms from our little Attribution Ontology)
- All the inference rules for UMBEL, YAGO, OpenCYC, and DBpedia-OWL data dictionaries
- All Full Text Indexes
- All Bitmap Indexes.
Tomorrow is the official go live day (due to last minute price changes), but you can instantiate a paid Virtuoso AMI starting now :-)
To be continued...