Yu Xu
Department
of Computer Science and Engineering
La Jolla, CA 92093-0114, USA
Phone:
858-699-8109
E-mail:
WWW:
http://www.cs.ucsd.edu/~yxu
Research Interests
Query processing for XML, semi-structured, and
relational data; data integration; XML query languages (XQuery, XPath, XSLT),
storage, and indexing; information retrieval in semi-structured data; keyword
search in XML; databases and the Web; Web Services.
Education
Ph.D., Computer Science, University of California, San Diego (expected) Summer 2005
M.S., Computer Science, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, June 1999
B.S.E.,
Computer Science, Northern Jiaotong University, Beijing, June 1996
Work Experience
IBM
AT&T
Labs Research
Florham Park, NJ (June-August 2002)
Summer intern worked
on the design and implementation of PIX, a system that permits phrase matching
in XML documents that contain “mixed content”. A key feature of PIX is that
users can specify which markup and annotations to ignore when matching a phrase.
The work was published in VLDB 2003, and the system was demonstrated in SIGMOD
2003 and ICDE 2003. PIX uses
inverted indices and an efficient evaluation algorithm to compute the set of
matches and returns answers where phrases, ignored tags and con-tent are
highlighted. In addition, query answers are sorted using a ranking function. PIX
is implemented as an extension of GALAX, a full-fledged XQuery engine. The
functionality of PIX is fully integrated into XQuery and permits a natural
combination of XPath-based structure matching with phrase matching.
Enosys
Software
Teaching
Experience
· Winter 2002 Teaching Assistant for CSE 130 Programming Languages,
· Fall 2001 Teaching Assistant for CSE 21Math,
· Spring 2001 Teaching Assistant for CSE 126 Multimedia Systems
· Spring 2000 Teaching Assistant for CSE 120 Operating Systems
· Winter 2000 Teaching Assistant for CSE 131B Compiler Construction II
Publications
Patents
Phrase matching in documents having nested-structure
arbitrary (document-specific) markup
Online Demonstrations
XML Query Algebra: http://feast.ucsd.edu/People/yu/xmlqademo/index.htm
XML Keyword Search: http://feast.ucsd.edu/People/yu/xksearch/index.htm
Phrase Matching in XML: http://teriyaki.ucsd.edu:9099/pix/index.htm
Java, C++/C, XQuery, XSL, JavaCC, YACC, JFLex, Java Servlet
, ML/OCaml, Prolog, Apache Tomcat,
Community Services
International
Conference on Data Engineering 2006
Program Committee member for "XML & Semi-structured Data"
Volunteer Webmaster for San Diego Chinese Association
References
Yannis Papakonstantinou, Associate Professor
Department of Computer Science and Engineering
La Jolla, CA 92093-0114, USA
Phone : (858) 822-1612
E-mail: yannis@cs.ucsd.edu
Alin Deutsch, Assistant Professor
Department of Computer Science and Engineering
La Jolla, CA 92093-0114, USA
Phone : (858) 822-2276
E-mail: deutsch@cs.ucsd.edu
Mary Fernandez, Principal Technical
Staff Member
AT&T Labs – Research
Florham park, NJ 07932-0971
Phone: (973) 360-8679
E-mail: mff@research.att.com
Divesh Srivastava, Head of
the Database Research Department
AT&T Labs – Research
Florham park, NJ 07932-0971
Phone: (973) 360-8776
E-mail: divesh@research.att.com
Don Chamberlin,
IBM Fellow, ACM
Fellow and a member of the National Academy of Engineering.
E-mail: chamberlin@almaden.ibm.com