OPQLPig: A Seamless Synergy between Pig and Provenance Query for Big Data

October 19, 2016
2:00 pm - 5:30 pm
Hall C

Track: General
Type: Posters
Level: All

We propose and design a framework for storing large provenance datasets in HDFS and querying them using OPQL, Pig and Hadoop. We extend OPQL, a graph-level provenance query language, to support W3C PROV-DM, standard provenance model; we propose algorithms to translate OPQL constructs to equivalent Pig Latin programs; and we develop and evaluate our OPQLPig solution on provenance datasets of UTPB.


, student, Wayne State University