Search
Search Results
-
Clustering Via Decision Tree Construction
Clustering is an exploratory data analysis task. It aims to find the intrinsic structure of data by organizing data objects into similarity groups or... -
A New Theoretical Framework for K-Means-Type Clustering
One of the fundamental clustering problems is to assign n points into k clusters based on the minimal sum-of-squares(MSSC), which is known to be... -
The Mathematics of Learning: Dealing with Data *
Learning is key to develo** systems tailored to a broad range of data analysis and information extraction tasks. We outline the mathematical... -
Web Page Classification*
This chapter describes systems that automatically classify web pages into meaningful categories. It first defines two types of web page... -
Sequential Pattern Mining by Pattern-Growth: Principles and Extensions*
Sequential pattern mining is an important data mining problem with broad applications. However, it is also a challenging problem since the mining may... -
A Feature/Attribute Theory for Association Mining and Constructing the Complete Feature Set
A correct selection of features (attributes) is vital in data mining. For this aim, the complete set of features is constructed. Here are some... -
Incremental Mining on Association Rules
The discovery of association rules has been known to be useful in selective marketing, decision analysis, and business management. An important... -
Web Mining – Concepts, Applications and Research Directions
From its very beginning, the potential of extracting valuable knowledge from the Web has been quite evident. Web mining, i.e. the application of data... -
Mining Association Rules from Tabular Data Guided by Maximal Frequent Itemsets
We propose the use of maximal frequent itemsets (MFIs) to derive association rules from tabular datasets. We first present an efficient method to... -
Privacy-Preserving Data Mining
The growth of data mining has raised concerns among privacy advocates. Some of this is based on a misunderstanding of what data mining does. The... -
Logical Regression Analysis: From Mathematical Formulas to Linguistic Rules
Data mining means the discovery of knowledge from (a large amount of)data, and so data mining should provide not only predictions but also knowledge... -
An empirical study on cross-component dependent changes: A case study on the components of OpenStack
Modern software systems are composed of several loosely coupled components. Typical examples of such systems are plugin-based systems, microservices,...
-
When is it acceptable to break the rules? Knowledge representation of moral judgements based on empirical data
Constraining the actions of AI systems is one promising way to ensure that these systems behave in a way that is morally acceptable to humans. But...
-
Towards Cardinality-Aware Evidential Combination Rules in Dempster–Shafer Theory
The Dempster–Shafer theory has garnered significant attention for effectively managing uncertainty across various disciplines. However, the core...
-
Rethinking AI code generation: a one-shot correction approach based on user feedback
Code generation has become an integral feature of modern IDEs, gathering significant attention. Notable approaches like GitHub Copilot and TabNine...
-
HyperMatch: long-form text matching via hypergraph convolutional networks
Semantic text matching plays a vital role in diverse domains, such as information retrieval, question answering, and recommendation. However, longer...
-
Software verification challenges in the blockchain ecosystem
Blockchain technology has created a new software development context, with its own peculiarities, mainly due to the guarantees that the technology...
-
Progress on half a century of process modelling research in steelmaking: a review
Process modelling in steelmaking started from mid-sixties and witnessed rapid growth and wide spread applications during the last fifty years or so....
-
Improving laryngeal cancer detection using chaotic metaheuristics integration with squeeze-and-excitation resnet model
Laryngeal cancer (LC) represents a substantial world health problem, with diminished survival rates attributed to late-stage diagnoses. Correct...
-
An exploratory evaluation of code smell agglomerations
Code smell is a symptom of decisions about the system design or code that may degrade its modularity. For example, they may indicate inheritance...