Advanced Data Mining and Applications
5th International Conference, ADMA 2009, Bei**g, China, August 17-19, 2009. Proceedings
Article
In this paper we study the problem of searching the Web with online learning algorithms. We consider that Web documents can be represented by vectors of n boolean attributes. A search engine is viewed as a lea...
Article
In this paper we report our research on building WebSail, an intelligent web search engine that is able to perform real-time adaptive learning. WebSail learns from the user's relevance feedback, so that it is abl...
Article
Document categorization as a technique to improve the retrieval of useful documents has been extensively investigated. One important issue in a large-scale metasearch engine is to select text databases that ar...
Book and Conference Proceedings
5th International Conference, ADMA 2009, Bei**g, China, August 17-19, 2009. Proceedings
Chapter and Conference Paper
Collaborative Filter (CF) methods supply favorably personalized predictions relying on adequate data from users. But the ratings, of new users or about new items are not always available and CF can’t make a pr...
Chapter and Conference Paper
How to discover the suited telecommunications service more quickly is important for improving the user experience. In the field of semantic web service most of discovery algorithms pay attention to the precisi...
Chapter and Conference Paper
In the k-nearest neighbor (KNN) classifier, nearest neighbors involve only labeled data. That makes it inappropriate for the data set that includes very few labeled data. In this paper, we aim to solve the classi...
Chapter and Conference Paper
In this paper, we propose a method for identifying and ranking possible categories of any user query based on the meanings and common usages of the terms and phrases within the query. Our solution utilizes Wor...
Chapter and Conference Paper
Current search engines do not explicitly take different meanings and usages of user queries into consideration when they rank the search results. As a result, they tend to retrieve results that cover the most ...
Chapter and Conference Paper
In this paper, we propose a new image search method, called “panoramic image search”, and show its application to similar landscape discovery. In order to perform the “panoramic image search”, we introduce an ...
Article
In this paper, we explore heterogenous information networks in which each vertex represents one entity and the edges reflect linkage relationships. Heterogenous information networks contain vertices of several...
Article
Data glitches are errors in a dataset. They are complex entities that often span multiple attributes and records. When they co-occur in data, the presence of one type of glitch can hinder the detection of anot...
Article
In this paper, given a set of check-in data, we aim at discovering representative daily movement behavior of users in a city. For example, daily movement behavior on a weekday may show users moving from one to...
Article
Given multimillion-node graphs such as “who-follows-whom”, “patent-cites-patent”, “user-likes-page” and “actor/director-makes-movie” networks, how can we find unexpected behaviors? When companies operate on th...
Article
The problem of similarity learning is relevant to many data mining applications, such as recommender systems, classification, and retrieval. This problem is particularly challenging in the context of networks,...
Article
Providing top-k typical relevant keyword queries would benefit the users who cannot formulate appropriate queries to express their imprecise query intentions. By extracting the semantic relationships both between...
Article
Internet users may suffer the empty or too little answer problem when they post a strict query to the Web database. To address this problem, we develop a general framework to enable automatically query relaxat...
Article
SPARQL, the W3C standard for RDF query languages, has gained significant popularity in recent years. An increasing amount of effort is currently being exerted to improve the functionality and usability of SPAR...
Article
In order to facilitate the accesses of general users to knowledge graphs, an increasing effort is being exerted to construct graph-structured queries of given natural language questions. At the core of the con...
Article
The task of temporal slot filling (TSF) is to extract values of specific attributes for a given entity, called “facts”, as well as temporal tags of the facts, from text data. While existing work denoted the te...