-
Chapter and Conference Paper
\(\textbf{E}^{3}\) -MG: End-to-End Expert Linking via Multi-Granularity Representation Learning
Expert linking is a task to link any mentions with their corresponding expert in a knowledge base (KB). Previous works that focused on explicit features did not fully exploit the fine-grained linkage and pivot...
-
Chapter and Conference Paper
Wukong-CMNER: A Large-Scale Chinese Multimodal NER Dataset with Images Modality
So far, Multimodal Named Entity Recognition (MNER) has been performed almost exclusively on English corpora. Chinese phrases are not naturally segmented, making Chinese NER more challenging; nonetheless, Chine...
-
Article
CK-Modes Clustering Algorithm Based on Node Cohesion in Labeled Property Graph
The designation of the cluster number K and the initial centroids is essential for K-modes clustering algorithm. However, most of the improved methods based on K-modes specify the K value manually and generate th...
-
Chapter and Conference Paper
Efficient Queries Evaluation on Block Independent Disjoint Probabilistic Databases
Probabilistic data management has recently drawn much attention of the database research community. This paper investigates safe plans of queries on block independent disjoint (BID) probabilistic databases. Th...
-
Chapter and Conference Paper
Reflection on the Popularity of MapReduce and Observation of Its Position in a Unified Big Data Platform
In recent years MapReduce has risen to be the de-facto tool for big data processing. MapReduce is a disruptive innovation. It has changed the landscape of database market, the landscape of technologies, as wel...
-
Chapter and Conference Paper
Efficient Responsibility Analysis for Query Answers
Provenance information describes the origins and the history of data in its life cycle. Responsibility captures the notion of degree of causality and tells us which facts are the most influential in the lineag...
-
Chapter and Conference Paper
H-Tree: A Hybrid Structure for Confidence Computation in Probabilistic Databases
Probabilistic database has become a popular tool for uncertain data management. Most work in the area is focused on efficient query processing and has two main directions, accurate or approximate evaluation. I...
-
Article
Rule induction for uncertain data
Data uncertainty are common in real-world applications and it can be caused by many factors such as imprecise measurements, network latency, outdated sources and sampling errors. When mining knowledge from the...
-
Chapter and Conference Paper
Classify Uncertain Data with Decision Tree
This demo presents a decision tree based classificationsystem for uncertain data. Decision tree is a commonlyused data classification technique. Tree learning algorithms cangenerate decision tree models from a...
-
Chapter and Conference Paper
Cleaning Uncertain Streams for Query Improvement
Real-world applications confront uncertain streams derived from unreliable data acquisition equipments and/or defective processing algorithms. However, application context covers specific cleaning rules to bri...
-
Chapter and Conference Paper
Cleaning Uncertain Streams by Parallelized Probabilistic Graphical Models
Real-world applications generate uncertain streams due to unreliable equipments and/or data processing such as object identification. However, application context implies specific rules, which are critical in ...
-
Chapter and Conference Paper
DTU: A Decision Tree for Uncertain Data
Decision Tree is a widely used data classification technique. This paper proposes a decision tree based classification method on uncertain data. Data uncertainty is common in emerging applications, such as sen...
-
Chapter and Conference Paper
Characterizing DSS Workloads from the Processor Perspective
In this paper, we characterized the TPC-H benchmark on an Itanium II processor. Our experiment results clearly demonstrate: (1) On Itanium II processor, the memory stall time is dominanted by first level (L1) ...
-
Article
2DCMA: An Effective Maintenance Algorithm of Materialized Views in Peer Data Management Systems
Update management is very important for data integration systems. So update management in peer data management systems (PDMSs) is a hot research area. This paper researches on view maintenance in PDMSs. First,...
-
Chapter and Conference Paper
Materialized View Maintenance in Peer Data Management Systems
The problem of sharing data in peer data management systems (PDMSs) has received considerable attention in recent years. However, update management in PDMSs has received very little attention. This paper propo...
-
Article
A commit strategy for distributed real-time transaction
Ramamritham gives three common types of constraints for the execution history of concurrent transactions. This paper extends the constraints and gives the fourth type of constraint. Then the weak commit depend...
-
Article
A hybrid distributed optimistic concurrency control method for high-performance real-time transaction processing
The conventional lock scheme tends to suffer from a cascade of blockings, while the optimistic concurrency control (OCC) scheme may suffer from wasting resources. To overcome these problems, some researchers h...