IEEE Symposium on Computational Intelligence and Data Mining (CIDM 2007)

Table of Contents

Link Analysis of Incomplete Relationship Networks...1

Edward Harrington

Validity of Probabilistic Rules...6

Marina Sapir and Mikhail Teverovskiy

An Efficient Distance Calculation Method for Uncertain Objects...10

Lurong Xiao and Edward Hung

K2GA: Heuristically Guided Evolution of Bayesian Network Structures from Data...18

Eli Faulkner

Extracting Borderline Associations...26

Wei Kian Chen, Dustin Baumgartner and Ryan Millikin

Selecting the Right Peer Schools for AACSB Accreditation - A Data Mining Application...31

Melody Kiang, Fisher Dorothy, Fisher Steve and Chi Robert

Structure Prediction in Temporal Networks using Frequent Subgraphs...35

Mayank Lahiri and Tanya Berger-Wolf

An Analytical Evaluation of Objective Measures Behavior for Generalized Association Rules...43

Veronica Carvalho, Solange Rezende and Mario Castro

Toward Versatile and Efficient Meta-Learning: Knowledge Representation and Management in Computational Intelligence...51

Krzysztof Grabczewski and Norbert Jankowski

Query-sensitive Feature Selection for Lazy Learners...59

Xin Tong and Mingyang Gu

Comparison of Classifiers Efficiency on Missing Values Recovering: Application in a Marketing Database with Massive Missing Data...66

Luis Zarate, Bruno Nogueira and Tadeu Santos

Manifold Learning using Growing Locally Linear Embedding...73

Junsong Yin, Dewen Hu and Zongtan Zhou

A Novel Complex-Valued Counterpropagation Network...81

Kanishka Tyagi, Deepak Mishra and Prem Kumar Kalra

A Prototype-driven Framework for Change Detection in Data Stream Classification...88

Hamed Valizadegan and Pang-Ning Tan

Evolutionary Optimization of Three-Photon Absorption in Molecular Iodine...96

Robert Burbidge, Nicholas Form, Jem Rowland, Benjamin Whitaker and Ross King

Induction tree to classify M. tuberculosis spoligotypes...101

Georges Valetudie

Data Clustering and Fuzzy Neural Network for Sales Forecasting in Printed Circuit Board Industry...107

Pei-Chann Chang, Chen-Hao Liu, Chin-Yuan Fan and Hsiao-Ching Chang

Fuzzy Wavelet Modeling Using Data Clustering...114

Nasser Sadati and Bahram Marami

SSM : A Frequent Sequential Data Stream Patterns Miner...120

Christie Ezeife and Mostafa Monwar

0-SM: A fast algorithm for mining Candidate Clusters in Pattern-based Clustering...127

Guo Jingfeng, Ma Qian and Liu Hanfeng

Cluster Detection with the PYRAMID Algorithm...133

Samir Tout, William Sverdlik and Sun Junping

Resource-aware Online Data Mining in Wireless Sensor Networks...139

Nhan Duc Phung, Mohamed Medhat Gaber and Uwe Roehm

Navigation Space Based Intranet Usability Analysis...147

Peter Geczy, Izumi Noriaki, Akaho Shotaro and Hasida Koiti

Algorithms for Fast Large Scale Data Mining Using Logistic Regression...155

Omid Rouhani-Kalleh

Influence of a priori Knowledge on Medical Document Categorization...163

Lukasz Itert, Wlodzislaw Duch and John Pestian

Evaluating Protein Motif Significance Measures: A case study on Prosite Patterns...171

Pedro G. Ferreira and Paulo J. Azevedo

Exploiting Semantic Descriptions of Products and User Profiles for Recommender Systems...179

Pingfeng Liu, Guihua Nie and Donglin Chen

Search Result Refinement via Machine Learning from Labeled-Unlabeled Data for Meta-search...186

Burak Ozyurt and Greg Brown

Mining the Students' Learning Interest in Browsing Web-Streaming Lectures...194

Long Wang and Christoph Meinel

Detection of Unknown Computer Worms Activity Based on Computer Behavior using Data Mining...202

Robert Moskovitch, Ido Gus, Shay Pluderman, Dima Stopel and Clint Feher

Identifying Anatomical Phrases in Clinical Reports by Shallow Semantic Parsing Methods...210

Vijayaraghavan Bashyam and Ricky Taira

Finding Similarity Relations in Presence of Taxonomic Relations in Ontology Learning Systems...215

Alireza Vazifedoost, Farhad Oroumchian and Masoud Rahgozar

SmartPortal for Biomedical Data Mining...221

Anna L. Buczak, Charles Wan and Glenn Petry

A Novel Image Semantic Block Clustering Method based on Artificial Visual Cortical Responding Model...228

Zhiping Xu, Shiyong Zhang and Shengxiang Ma

On Obtaining Fuzzy Rule Base from Ensemble of Takagi-Sugeno Systems...234

Korytkowski Marcin, Rutkowski Leszek, Scherer Rafal and Drozda Grzegorz

Using concept structures for efficient document comparison and location...238

Andrew Edmonds

Reactive Query Policies: A Formalism for Planning with Volatile External Information...243

Tsz-Chiu Au and Dana Nau

Using Data Mining to Enhance Automated Planning and Scheduling...251

Jeremy Frank

Hazard Situation Prediction Using Spatially and Temporally Distributed Vehicle Sensor Information...261

Thomas Schoen, Bernhard Sick and Markus Strassberger

An Efficient Data Management Scheme Based on Spatial and Temporal Characteristics in Virtual Environments...269

Hsing-Jen Chen and Damon Shing-Min Liu

Data Mining based Query Processing using Rough Sets and Genetic Algorithms...275

Srinivasa Kg, Jagadish M, Venugopal Kr and Patnaik Lalit M

A Quantitative Method for Analyzing Scan Path Data Obtained by Eye tracker...283

Haruhiko Takeuchi and Yoshiko Habuchi

Gaussian Process Latent Variable Models for Fault Detection...287

Luka Eciolaza, Muhammad Alkarouri, Neil D. Lawrence, Visakan Kadirkamanathan and Peter J. Fleming

Measuring the Validity of Document Relations Discovered from Frequent Itemset Mining...293

Kritsada Sriphaew and Thanaruk Theeramunkong

One-shot Collaborative Filtering...300

Shuhei Kuwata and Naonori Ueda

Efficient Kernel-based Learning for Trees...308

Fabio Aiolli, Giovanni Da San Martino, Alessandro Sperduti and Alessandro Moschitti

Knowledge Based Stacking of Hyperspectral Data for Land Cover Classification...316

Yangchi Chen, Melba Crawford and Joydeep Ghosh

A Classifier Capable of Handling New Attributes...323

Dong-Hun Seo, Chi-Hwa Song and Won Don Lee

Fuzzy c-Means Classifier for Relational Data...328

Hidetomo Ichihashi, Katsuhiro Honda, Yasuhiro Kuramoto and Fumiaki Matsuura

Mining Subspace Correlations...335

Rave Harpaz and Robert Haralick

Selective Approach To Handling Topic Oriented Tasks On The World Wide Web...343

Amit Awekar and Jaewoo Kang

A Constraint-based Method for Semantic Mapping from Natural Language Questions to OWL...349

Mingxia Gao, Jiming Liu, Ning Zhong and Furong Chen

iScore: Measuring the Interestingness of Articles in a Limited User Environment...354

Raymond Pon, Alfonso Cardenas, David Buttler and Terence Critchlow

Prediction of Abnormal Behaviors for Intelligent Video Surveillance Systems...362

Duarte Duque, Henrique Santos and Paulo Cortez

A Data Mining Method Applied to a Metallurgical Process...368

Henrik Saxen and Frank Pettersson

Distributed Document Clustering Using Word-clusters...376

Debzani Deb and Rafal Angryk

Session recognition and Bandwidth Guarantee for Encrypted Internet Voice Traffic: Case Study of Skype...384

Wang Jian-Hong, Pan J.-Y. and Cheng Yi-Chi

Classification of XML Documents...390

Abdelhamid Bouchachia and Bouchachia Hassler

Detection and Classification of Cardiac Murmurs using Segmentation Techniques and Artificial Neural Networks...397

Spencer Strunic, Fernando Rios-Gutierrez, Rocio Alba-Flores, Glenn Nordehn and Stanley Burns

Data Mining an EEG Dataset With an Emphasis on Dimensionality Reduction...405

Pari Jahankhani, Kenneth Revett and Vassillis Kodogiannis

Example-based Estimation of Hand's Strength in the Game of Bridge with or without Using Explicit Human Knowledge...413

Jacek Mandziuk and Krzysztof Mossakowski

Collaborative Knowledge Discovery and Data Mining: From Knowledge to Experience...421

Timo Horeis and Bernhard Sick

Using Hypergraph-based Clustering Scheme for Traversal Prediction in Virtual Environments...429

Hung Shao-Shin and Liu Damon Shing-Min

Mining Maximal Embedded Unordered Tree Patterns...437

Mostafa Haghir Chehreghani, Masoud Rahgozar, Caro Lucas and Morteza Haghir Chehreghani

Extraction of Non-hierarchical Relations from Domain Texts...444

Janardhana Punuru and Jianhua Chen

Clustering Rooted Ordered Trees...450

Mostafa Haghir Chehreghani, Masoud Rahgozar, Caro Lucas and Morteza Haghir Chehreghani

Scalable Clustering for Large High-Dimensional Data Based on Data Summarization...456

Ying Lai, Ratko Orlandic, Wai Gen Yee and Sachin Kulkarni

A Dynamic Graph Model for Analyzing Streaming News Documents...462

Elizabeth Hohman and David Marchette

Establishing User Profiles in the MediaScout Recommender System...470

Lior Rokach and Lihi Naamani

On Mining Dynamic Web ClickStreams for Frequent Traversal Sequences...477

Ren Jiadong and Zhang Xiaojian

More Efficient Classification of Web Content Using Graph Sampling...485

Chris Bennett

Handwritten Digit Recognition - Road to Contest Victory...491

Norbert Jankowski and Krzysztof Grabczewski

Evolutionary Neural Networks Applied to Land-cover Classification in Zhaoyuan, China...499

Yan Guo, Lishan Kang, Fujiang Liu, Huashan Sun and Linlu Mei

Incremental Local Outlier Detection for Data Streams...504

Dragoljub Pokrajac, Aleksandar Lazarevic and Longin Jan Latecki

Adaptive Frequency Counting over Bursty Data Streams...516

Bill Lin, Wai-Shing Ho, Ben Kao and Chun-Kit Chui

From similarity retrieval to cluster analysis: The case of R* trees...524

Jiaxiong Pi, Yong Shi and Zhengxin Chen (Corresponding author)

GAIS: A Method for Detecting Interleaved Sequential Patterns from Imperfect Data...530

Marja Ruotsalainen, Timo Ala-Kleemola and Ari Visa

A Modified Genetic Programming for Behavior Scoring Problem...535

Qing-Shan Chen, De-Fu Zhang, Li-Jun Wei and Huo-Wang Chen

Model Selection for Anomaly Detection in Wireless Ad Hoc Networks...540

Hongmei Deng and Roger Xu

A Dynamic Programming Algorithm for Name Matching...547

Philip Top, Farid Dowla and Jim Gansemer

MSP-CACRR: Multidimensional Sequential Patterns Based Call Admission Control and Resource Reservation for Next-Generation Wireless Cellular Networks...552

Sherif Rashad, Mehmed Kantardzic and Anup Kumar

Matching Random Tree Models of Spatio-Temporal Patterns to Tables or Graphs...560

David Paglieroni and Faranak Nekoogar

UNI3 - Efficient Algorithm for Mining Unordered Induced Subtrees Using TMG Candidate Generation...568

Hadzic Fedja, Tan Henry and Dillon Tharam

A Visual Approach for External Cluster Validation...576

Ke-Bing Zhang, Mehmet Orgun and Kang Zhang

Post-supervised Fuzzy c-Means Classifier with Hard Clustering...583

Hidetomo Ichihashi, Katsuhiro Honda, Naho Kuwamoto and Takao Hattori

Feature Selection for Change Detection in Multivariate Time-Series...590

Michael Botsch and Josef Nossek

Using the Pre-FUFP Algorithm for Handling New Transactions in Incremental Mining...598

Chun-Wei Lin, Tzung-Pei Hong and Wen-Hsiang Lu

Time Series Forecasting Using Multiple Gaussian Process Prior Model...604

Tomohiro Hachino and Visakan Kadirkamanathan

Mining Association Rules in Temporal Sequences...610

Khellaf Bouandas and Aomar Osmani

A New Evolutionary Approach for Time Series Forecasting...616

Tiago A. E. Ferreira, Germano C. Vasconcelos and Paulo J. L. Adeodato

Association Rule Discovery Has the Ability to Model Complex Genetic Effects...624

William Bush, Tricia Thornton-Wells and Marylyn Ritchie

Quantifying Privacy for Privacy Preserving Data Mining...630

Justin Zhan

Using Homomorphic Encryption For Privacy-Preserving Collaborative Decision Tree Classification...637

Justin Zhan

Privacy Preserving Burst Detection of Distributed Time Series Data Using Linear Transforms...646

Lisa Singh and Mehmet Sayal

Efficient Privacy-Preserving Association Rule Mining: P4P Style...654

Yitao Duan, John Canny and Justin Zhan

PCGEN: A Practical Approach to Projected Clustering and its Application to Gene Expression Data...661

Bouguessa Mohamed and Wang Shengrui

Classification of Biomedical Spectra Using Fuzzy Interquartile Encoding and Stochastic Feature Selection...668

Nick Pizzi, Mark Alexiuk and Witold Pedrycz

Semi-supervised Learning of the Hidden Vector State Model for Protein-Protein Interactions Extraction...674

Deyu Zhou, Yulan He and Chee Keong Kwoh

Designing a Fuzzy Rule Based System to Estimate Depth of Anesthesia...681

Vahid Esmaeili, Amin Assareh, Mohammad Bagher Shamsollahi, Mohammad Hasan Moradi and Noor Mohammad Arefian

Discovery of Temporal Dependencies between Frequent Patterns in Multivariate Time Series...688

Giridhar Tatavarty, Raj Bhatnagar and Barrington Young

Noise Reduction Approach for Decision Tree Construction: A Case Study of Knowledge Discovery on Climate and Air Pollution...697

Kyoko Fukuda

Application of Neural Networks for Data Modeling of Power Systems with Time Varying Nonlinear Loads...705

Joy Mazumdar, Ganesh Venayagamoorthy and Ronald Harley

Data Mining of MISR Aerosol Product using Spatial Statistics...712

Tao Shi and Noel Cressie

ADMIRAL: A Data Mining Based Financial Trading System...720

Gil Rachlin, Mark Last, Dima Alberg and Abraham Kandel

A Self-Impact Analysis by Artificial Market Simulation...726

Kiyoshi Izumi, Hiroki Matsui and Yutaka Matsuo

Spatial Data Mining for Optimized Selection of Facility Locations in Field-based Services...734

Ashkan Zarnani, Masoud Rahgozar, Caro Lucas and Fattaneh Taghiyareh

Mining time series data via linguistic summaries of trends by using a modified Sugeno integral based aggregation...742

Janusz Kacprzyk, Anna Wilbik and Slawomir Zadrozny

Analysis of Streaming GPS Measurements of Surface Displacement Through a Web Services Environment...750

Robert Granat, Galip Aydin, Marlon Pierce, Zhigang Qi and Yehuda Bock

Exchange Rates Forecasting Using a Hybrid Fuzzy and Neural Network Model...758

An-Pin Chen and Hsio-Yi Lin

Design of Multi-Singing Karaoke System and its Application in Customer Finance-aided Service on Internet...764

Jian-Hong Wang, Shih-Chuan Feng and J.-Y. Pan