|
|
 |
|
Active Projects |
| |
-
Adaptive Data Cleaning for Business
Intelligence: The main goal of the project is the development of a full
logical language for expressing data quality constraints (DQCs).
Since data cleaning and data quality assessments are context dependent, the
specification of DQCs will be extended with logic-based specifications of
contexts. These contexts would provide (or constraint) the extensions of the
predicates in DQCs. Another goal is to learn from data cleaning activities,
with the purpose of deriving both specifications of DQCs and declarative
rules for data cleaning. This project is part of the
NSERC Business Intelligence Network.
-
Business-driven data integration: The
main goal of the project is to produce a new methodology for top-down
integration driven by a conceptual model. This conceptual integration
model would function as a semantic layer used by business users to
specify what info is needed and in what form; the system must satisfy the
user request in a (semi-) automatic fashion. This project is part of
the
NSERC Business Intelligence Network.
-
Papyrus: A
multinational European project for building a
cross-discipline digital library engine that draws content from one
domain and makes it available to a community of users who belong to a
totally different discipline. Some ontology management research issues in
this context
include modeling concept evolution and semantic updates, and support for
dynamic attributes (attributes for which domains and ranges are specified
declaratively).
|
|
Past Projects |
| |
-
DescribeX: A
framework for structural summaries developed as part of my PhD thesis work. DescribeX supports constructing heterogeneous
XML summaries that can be declaratively defined and manipulated by means of
path regular expressions on axes. DescribeX not only captures most previous summary proposals but also
provides a declarative way of defining entirely new ones. Some applications
include exploring the heterogeneous structure of large XML collections and adapting DescribeX summaries to
XPath query workloads.
-
Temporal XML:
A proposal for modeling and implementing temporal data in XML. It includes
algorithms for validating temporal XML documents against the temporal
constraints imposed by the model, as well as a
framework for summarizing metadata that adds the
time dimension to structural path summaries.
-
XPlainer: A
language for providing visual explanations of XPath queries, a kind of data
provenance of the answer.
XPlainer-Eclipse,
a tool that implements visual explanations using Java and Eclipse, can be
found
here.
-
ToX
(the Toronto XML Server): A
repository of XML data and metadata that provides the key functions in
document management, including registering documents, indexing document
structure (with ToXin),
defining logical views of distributed data sources, and querying document
content and structure.
|
| |
|