8th International Conference on Database Systems for Advanced Applications (DASFAA 2003)
Conference Program

26 - 28 March, 2003, Kyoto, Japan

March 26 (Wed.)

9:30-9:35
Opening (Room-1, 5F)
Katsumi Tanaka, general chair
Masatoshi Yoshikawa, program co-chair
9:35-10:35
Session Chair: Sang Kyun Cha
Keynote: A Survey of New Directions in Database Systems (Room-1, 5F)
Jeffrey D. Ullman (Stanford University)

Abstratct: As database system research evolves, there are several enduring themes. One, of course, is how we deal with the largest possible amounts of data. A less obvious theme is optimization --- it is an essential ingredient of all modern forms of database system. Because we deal with large volumes of data, we are often forced to process that data in regular ways. But when operations are uniform, there is an opportunity for the use of very-high-level languages, of which SQL is the primary example. However, to make a very-high-level language effective, we need to optimize it well, that is, produce effective query plans from all sorts of queries.

In this talk, we shall review the principal directions in which modern database research is going, and in each case talk a bit about the optimization problems. Stream management systems are one very important new area. Another is peer-to-peer database systems. Integration of heterogeneous information, especially in virtual databases, is also a major challenge. XML, XQUERY, and semistructured data in general form yet another research opportunity.

10:50-12:20

Session 1: Spatial Query Processing (Room-1, 5F)

Similarity Join for Low- and High-Dimensional Data
Dmitri V. Kalashnikov, Sunil Prabhakar
Spatial Query Processing for High Resolutions
Hans-Peter Kriegel, Martin Pfeifle, Marco Pötke, Thomas Seidl
Effective Similarity Search on Voxelized CAD Objects
Hans-Peter Kriegel, Peer Kröger, Zahi Mashael, Martin Pfeifle, Marco Pötke, Thomas Seidl

Session 2: Data Integrity and Security (Room-2, 4F)

Discovering Direct and Indirect Matches for Schema Elements
Li Xu, David W. Embley
SANGAM: A Transformation Modeling Framework
Kajal T. Claypool, Elke A. Rundensteiner
Securing Your Data in Agent-Based P2P Systems
Xiaolin Pang, Barbara Catania, Kian-Lee Tan
13:40-15:10

Session 3: Data Mining (Room-1, 5F)

Ascending Frequency Ordered Prefix-tree: Efficient Mining of Frequent Patterns
Guimei Liu, Hongjun Lu, Yabo Xu, Jeffrey Xu Yu
An Efficient Sliding Window Algorithm for Detection of Sequential Patterns
Lilian Harada
Caucus-based Transaction Clustering
Jinmei Xu, Sam Yuan Sung

Session 4: WWW and Multidatabases (Room-2, 4F)

TAX-PQ: Dynamic Taxonomy Probing and Query Modification for Topic-Focused Web Search
Said Mirza Pahlevi, Hiroyuki Kitagawa
Finding a Web Community by Maximum Flow Algorithm with HITS Score Based Capacity
Noriko Imafuji, Masaru Kitsuregawa
Scalable View Expansion in a Peer Mediator System
Timour Katchaounov, Vanja Josifovski, Tore Risch
15:30-17:00

Session 5: Text Mining and Knowledge Discovery (Room-1, 5F)

Mining Emerging Substrings
Sarah Chan, Ben Kao, C. L. Yip, Michael Tang
Fast text classification: A Training-Corpus Pruning Based Approach
Shuigeng Zhou, Tok Wang Ling, Jihong Guan, Jiangtao Hu, Aoying Zhou
Efficient Record Linkage in Large Data Sets
Liang Jin, Chen Li, Sharad Mehrotra

Session 6: Query Optimization (Room-2, 4F)

Maintenance of Partial-Sum-Based Histograms
Kin Fai Kan, David W. Cheung, Ben Kao
Selectivity Estimation Using Orthogonal Series
Feng Yan, Wen-Chi Hou, Qiang Zhu
Error Minimization for Approximate Computation of Range Aggregations
Xuemin Lin, Qing Zhang
17:30-18:30
Welcome Reception
Hall (2F), Kyoto Campus Plaza

March 27 (Thu.)

9:20-10:50
Session Chair: Hiroshi Ishikawa
Tutorial 1:Statistical Learning methods for Emerging Database Applications (Room-1, 5F)
E. Chang (UC Santa Barbara)
11:10-12:10

Session 7: Moving Object Databases (Room-1, 5F)

Q+Rtree: Efficient Indexing for Moving Object Databases
Yuni Xia, Sunil Prabhakar
Efficient Index Update for Moving Objects with Future Trajectories
Rui Ding, Xiaofeng Meng, Yun Bai

Session 8: Caching (Room-2, 4F)

Prefetching for Visual Data Exploration
Punit R. Doshi, Elke A. Rundensteiner, Matthew O. Ward
Freshness-driven Adaptive Caching for Dynamic Content
Wen-Syan Li, Oliver Po, Wang-Pin Hsiung, K. Selçuk Candan, Divyakant Agrawal
13:40-15:10

Session 9: Data Warehouse (Room-1, 5F)

Time-Stratified Sampling for Approximate Answers to Aggregate Queries
Joåo Pedro Costa, Pedro Furtado
The BofS Solution to Limitations of Approximate Summaries
Pedro Furtado, Joao Pedro Costa
An Efficient and Interactive A*-Algorithm with Pruning Power: Materialized View Selection Revisited
Gang Gou, Jeffrey Xu Yu, Chi-Hon Choi, Hongjun Lu

Session 10: XML (1) (Room-2, 4F)

Supporting Selection-projection XQuery Processing Based on Encoding Paths
Ya-Hui Chang, Cheng-Ta Lee
A Fast and Versatile Path Index for Querying Semi-Structured Data
Michael Barg, Raymond Wong
PathGuide: An Efficient Clustering Based Indexing Method for XML Path Expressions
Jiefeng Cheng, Ge Yu, Jeffrey Xu Yu, Guoren Wang
15:30-17:00
Panel Session: Next-Generation Web Technology and Database Issues (Room-1, 5F)

Abstract: This panel discusses the new next-generation Web technologies (semantic Web, Web conceptual modeling, Web services, stream data modeling, multimedia content processing etc.) and the database issues. Especially, this panel focuses on what database issues will become important according to the recent advances in the Web technologies.

Panelists:
  • Prof. Dick Lun Lee
  • Prof. Tok Wang Ling
  • Prof. Stefano Spaccapietra
  • Prof. Kyu-Young Whang
18:30-20:30
Banquet
Rakusui (Japanese garden and Japanese dishes), near Nanzen temple (http://www.rakusui.com/homeframe.htm).

March 28 (Fri.)

9:20-10:50
Session Chair: Hiroshi Ishikawa
Tutorial 2: Unstructured Information Management (Room-1, 5F)
M. Mohania (IBM India Research Lab)
11:10-12:10

Session 11: Content-Based Data Retrieal (Room-1, 5F)

Indexing High-Dimensional Data for Content-Based Retrieval in Large Databases
Manuel J. Fonseca, Joaquim A. Jorge
iSearch: Mining Retrieval History for Content-Based Image Retrieval
Hongyu Wang, Beng Chin Ooi, Anthony K. H. Tung

Session 12: Stream Database and Filtering (Room-2, 4F)

M-Kernel Merging: Towards Density Estimation over Data Streams
Aoying Zhou, Zhiyuan Cai, Li Wei, Weining Qian
Composition of Filtering Functions
Rie Sawai, Masahiko Tsukamoto, Tsutomu Terada, Shojiro Nishio
13:40-15:40

Session 13: Bioinformatics and Database (Room-1, 5F)

Approximate String Matching in DNA Sequences
Lok-Lam Cheng, David W. Cheung, Siu-Ming Yiu
An Efficient Index-based Protein Structure Database Searching Method
Zeyar Aung, Wei Fu, Kian-Lee Tan
Integrating, Managing and Analyzing Protein Structures with XML Databases
William Shui, Raymond K. Wong, Stephen C. Graham, Lawrence Lee, W. Bret Church
A Clustering Method for Comparative Analysis between Genomes and Pathways
Shoko Miyake, Yukako Tohsato, Yoichi Takenaka, Hideo Matsuda

Session 14: XML (2) (Room-2, 4F)

Cost-Driven Storage Schema Selection for XML
Shihui Zheng, Ji-Rong Wen, Hongjun Lu
Performance Enhancement through Structural Redundancy in Mapping XML into Relations
Jaehoon Kim, Seog Park
Mining Frequent Query Patterns from XML Queries
Liang Huai Yang, Mong Li Lee, Wynne Hsu, Sumit Acharya
GLASS: A Graphical Query Language for Semi-Structured Data
Wei Ni, Tok Wang Ling