A New Paper at ICDE 2019.

Our paper, Extending Fine-Grained Provenance to ETL Tasks, was accepted for publication at the 35th International Conference on Data Engineering 2019. Here is the abstract. Data provenance tools facilitate reproducibility by capturing the steps used to produce analyses. However, there are trade-offs among workflow provenance systems which allow arbitrary code and workflows but only track provenance Read more about A New Paper at ICDE 2019.[…]

Our ProvCite paper was accepted at VLDB 2019!

Our paper,  ProvCite: Provenance-based Data Citation, was accepted for publication at the 45th International Conference on Very Large Data Bases 2019. Here’s the abstract. A computational challenge associated with data citation is how to automatically generate citations to arbitrary queries against a structured dataset. Previous work has explored this problem in the context of conjunctive Read more about Our ProvCite paper was accepted at VLDB 2019![…]

Penn DB group at SIGMOD/PODS 2017

It’s a pleasure to share the accomplishment of two members of Penn database group at PODS 2017 conference here in Chicago.  Prof. Susan Davidson gave a wonderful keynote talk about our work on data citation and provenance. Prof. Val Tennan has received the ToT (test of time) award for his work on semirings provenance “The Semiring Framework for Database Read more about Penn DB group at SIGMOD/PODS 2017[…]

Automating data citation: the eagle-i experience

Great news! Our paper “Automating data citation: the eagle-i experience” has been accepted for inclusion in the JCDL 2017 conference. Abstract Data citation is of growing concern for owners of curated databases, who wish to give credit to the contributors and curators responsible for portions of the dataset and enable the data retrieved by a Read more about Automating data citation: the eagle-i experience[…]

Automating Data Citation in CiteDB

Our paper “Automating Data Citation in CiteDB” will appear in PVLDB 2017 (Munich, Germany). ABSTRACT An increasing amount of information is being collected in structured, evolving, curated databases, driving the ques- tion of how information extracted from such datasets via queries should be cited. While several databases say how data should be cited for web-page views Read more about Automating Data Citation in CiteDB[…]