Search
Search Results
-
A New Theoretical Framework for K-Means-Type Clustering
One of the fundamental clustering problems is to assign n points into k clusters based on the minimal sum-of-squares(MSSC), which is known to be... -
The Mathematics of Learning: Dealing with Data *
Learning is key to develo** systems tailored to a broad range of data analysis and information extraction tasks. We outline the mathematical... -
Web Page Classification*
This chapter describes systems that automatically classify web pages into meaningful categories. It first defines two types of web page... -
Clustering Via Decision Tree Construction
Clustering is an exploratory data analysis task. It aims to find the intrinsic structure of data by organizing data objects into similarity groups or... -
Sequential Pattern Mining by Pattern-Growth: Principles and Extensions*
Sequential pattern mining is an important data mining problem with broad applications. However, it is also a challenging problem since the mining may... -
Incremental Mining on Association Rules
The discovery of association rules has been known to be useful in selective marketing, decision analysis, and business management. An important... -
A Feature/Attribute Theory for Association Mining and Constructing the Complete Feature Set
A correct selection of features (attributes) is vital in data mining. For this aim, the complete set of features is constructed. Here are some... -
Web Mining – Concepts, Applications and Research Directions
From its very beginning, the potential of extracting valuable knowledge from the Web has been quite evident. Web mining, i.e. the application of data... -
Mining Association Rules from Tabular Data Guided by Maximal Frequent Itemsets
We propose the use of maximal frequent itemsets (MFIs) to derive association rules from tabular datasets. We first present an efficient method to... -
Logical Regression Analysis: From Mathematical Formulas to Linguistic Rules
Data mining means the discovery of knowledge from (a large amount of)data, and so data mining should provide not only predictions but also knowledge... -
Privacy-Preserving Data Mining
The growth of data mining has raised concerns among privacy advocates. Some of this is based on a misunderstanding of what data mining does. The... -
Interaction Design Patterns of Web Chatbots
Chatbots are often used in the web as an additional user interface which offers different modalities for users. Still, there is not yet an... -
Subjectivity, Polarity and the Aspect of Time in the Evolution of Crowd-Sourced Biographies
This study examines the use of subjective and sentimentally charged language in crowd-sourced articles by focusing on time and how articles in... -
Inclusive Counterfactual Generation: Leveraging LLMs in Identifying Online Hate
Counterfactually augmented data has recently been proposed as a successful solution for socially situated NLP tasks such as hate speech detection.... -
MatchCom: Stable Matching-Based Software Services Composition in Cloud Computing Environments
User preferences on throughput, latency, cost, service location, etc. indicate specific requirements when choosing a web service from the cloud... -
AuthApp – Portable, Reusable Solid App for GDPR-Compliant Access Granting
The Solid (Social Linked Data) technology family was developed to provide the foundation for Data Sovereignty in the context of web applications. The... -
Streamlining Vocabulary Conversion to SKOS: A YAML-Based Approach to Facilitate Participation in the Semantic Web
Controlled vocabularies, such as classification schemes, glossaries, taxonomies, or thesauri, play an important role in many Web services. One of the... -
AutoMaster: Differentiable Graph Neural Network Architecture Search for Collaborative Filtering Recommendation
Graph Neural Networks (GNNs) have been widely applied in Collaborative Filtering (CF) and have demonstrated powerful capabilities in recommender... -
Investigating the Usefulness of Product Reviews Through Bipolar Argumentation Frameworks
The importance of useful product reviews cannot be overstated, as they not only provide crucial information to potential buyers but also offer... -
GitHub-Sourced Web API Evolution: A Large-Scale OpenAPI Dataset
This study presents a dataset curated using a software tool called the crawler, which gathers OpenAPI Specifications (OAS) from GitHub. The crawler...