-
Chapter and Conference Paper
Machine Translation Systems: E-K, K-E, J-K, K-J
We present four kinds of machine translation system in this description: E-K (English to Korean), K-E (Korean to English), J-K (Japanese to Korean), K-J (Korean to Japanese). Among these, E-K and K-J translati...
-
Chapter and Conference Paper
Classification of the Risk Types of Human Papillomavirus by Decision Trees
The high-risk type of Human Papillomavirus (HPV) is the main etiologic factor of cervical cancer, which is a leading cause of cancer deaths in women worldwide. Therefore, classifying the risk type of HPVs is v...
-
Chapter and Conference Paper
Large Scale Unstructured Document Classification Using Unlabeled Data and Syntactic Information
Most document classification systems consider only the distribution of content words of the documents, ignoring the syntactic information underlying the documents though it is also an important factor. In this...
-
Chapter and Conference Paper
Automatic Webpage Classification Enhanced by Unlabeled Data
This paper describes a novel method for webpage classification that uses unlabeled data. The proposed method is based on a sequential learning of the classifiers which are trained on a small number of labeled ...
-
Article
Word Sense Disambiguation by Learning Decision Trees from Unlabeled Data
In this paper we describe a machine learning approach to word sense disambiguation that uses unlabeled data. Our method is based on selective sampling with committees of decision trees. The committee members a...
-
Chapter and Conference Paper
Part-of-Speech Tagging and PP Attachment Disambiguation Using a Boosted Maximum Entropy Model
We have proposed previously a boosted maximum entropy model to overcome three major problems in applying the maximum entropy models to text chunking [1]: (i) feature selection, (ii) high computational complexity,...
-
Chapter and Conference Paper
Genetic Mining of DNA Sequence Structures for Effective Classification of the Risk Types of Human Papillomavirus (HPV)
Human papillomavirus (HPV) is considered to be the most common sexually transmitted disease and the infection of HPV is known as the major factor for cervical cancer. There are more than 100 types in HPV and e...
-
Chapter and Conference Paper
PromSearch: A Hybrid Approach to Human Core-Promoter Prediction
This paper presents an effective core-promoter prediction system on human DNA sequence. The system, named PromSearch, employs a hybrid approach which combines search-by-content method and search-by-signal meth...
-
Chapter and Conference Paper
Constructing an Ontology Based on Terminology Processing
An ontology consists of a set and definition of concepts that presents the characteristics of a given domain and relationship between the elements. This paper proposes a semiautomatic method to construct a dom...
-
Chapter and Conference Paper
An Automatic Approach to Classify Web Documents Using a Domain Ontology
This paper suggests an automated method for document classification using an ontology, which expresses terminology information and vocabulary contained in Web documents by way of a hierarchical structure. Onto...
-
Chapter and Conference Paper
Automatic Word Spacing in Korean for Small Memory Devices
Automatic word spacing will be a very useful tool in a SMS (simple message service) , if it can be commercially served. However, the problems of implementing it in the devices such as mobile phones are small m...
-
Chapter and Conference Paper
Document Retrieval Using Semantic Relation in Domain Ontology
This paper proposes a semiautomatic method to build a domain ontology using the results of text analysis and applies it to a document retrieval system. In order to present usefulness for retrieving a document ...
-
Chapter and Conference Paper
Semantic Query Expansion Based on a Question Category Concept List
When confronted with a query, question answering systems endeavor to extract the most exact answers possible by determining the answer type that fits with the query and the key terms used in the query. However...
-
Chapter and Conference Paper
Clause Boundary Recognition Using Support Vector Machines
This paper proposes a method for Korean clause boundary recognition. Clause boundary identification can be regarded as a three-class classification task, and it can be converted into a two-phase binary classif...
-
Chapter and Conference Paper
Program Plagiarism Detection Using Parse Tree Kernels
Many existing plagiarism detection systems fail in detecting plagiarism when there are an abundant garbage in the copied programs. This is because they do not use the structural information efficiently. In thi...
-
Chapter and Conference Paper
Ontology-Based Automatic Classification of Web Pages
The use of ontology in order to provide a mechanism to enable machine reasoning has continuously increased during the last few years.This paper suggests an automated method for document classification using an...
-
Chapter and Conference Paper
Vehicle Color Classification Based on the Support Vector Machine Method
We present a vehicle color classification method from outdoor vehicle images. Although the vehicle color recognition is important especially for the newest applications including ITS (intelligent transportatio...
-
Chapter and Conference Paper
Dinucleotide Step Parameterization of Pre-miRNAs Using Multi-objective Evolutionary Algorithms
MicroRNAs (miRNAs) form a large functional family of small noncoding RNAs and play an important role as posttranscriptional regulators, by repressing the translation of mRNAs. Recently, the processing mechanis...
-
Chapter and Conference Paper
Dependency Analysis of Clauses Using Parse Tree Kernels
Identification of dependency relation among clauses is one of the most critical parts in parsing Korean sentences because it generates severe ambiguities. The resolution of the ambiguities involves both syntac...
-
Chapter and Conference Paper
Identification of Subject Shareness for Korean-English Machine Translation
One of the most critical issues in translating Korean into other languages is the common use of empty arguments. Since even mandatory elements in Korean are often dropped unlike English, the missing elements s...