A Web Information Retrieval System

Kim, Tae-Hyun; Park, Dong-Chul; Huh, Woong; Kim, Hyen-Ug; Yoon, Chung-Hwa; Park, Chong-Dae; Woo, Dong-Min; Jeong, Taikyeong; Cho, Il-Hwan; Lee, Yunsik

doi:10.1007/978-3-642-18134-4_81

Tae-Hyun Kim²,
Dong-Chul Park²,
Woong Huh²,
Hyen-Ug Kim²,
Chung-Hwa Yoon²,
Chong-Dae Park²,
Dong-Min Woo²,
Taikyeong Jeong²,
Il-Hwan Cho² &
…
Yunsik Lee³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 135))

Included in the following conference series:

International Conference on Intelligent Computing and Information Science

1141 Accesses

Abstract

An approach for the retrieval of price information from internet sites is applied to real-world application problems in this paper. The Web Information Retrieval System (WIRS) utilizes Hidden Markov Model (HMM) for its powerful capability to process temporal information. HMM is an extremely flexible tool and has been successfully applied to a wide variety of stochastic modeling tasks. In order to compare the prices and features of products from various web sites, the WIRS extracts prices and descriptions of various products within web pages. The WIRS is evaluated with real-world problems and compared with a conventional method and the result is reported in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 117.69; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Product Information Retrieval on the Web: An Empirical Study

Web Information Retrieval and Search

Semistructured Data Search

References

Chorbani, A.A., Xu, X.: A fuzzy markov model approach for predicting user navigation, pp. 307–311 (2007)
Google Scholar
Godoy, D., Amandi, A.: Learning browsing patterns for context-aware recommendation. In: Proc. of IFIP AI, pp. 61–70 (2006)
Google Scholar
Bayir, M.A., et al.: Smart Miner: A New Framework for Mining Large Scale Web Usage Data. In: Proc. of Int. WWW Conf., pp. 161–170 (2009)
Google Scholar
Cao, H., et al.: Towards Context-Aware Search by Learning A Very Large Variable Length Hidden Markov Model from Search Logs. In: Proc. of Int. WWW Conf., pp. 191–200 (2009)
Google Scholar
Brin, S., Page, L.: The Anatomy of a Large-Scale HypertextualWeb Search Engine. In: Proc. of Int. WWW Conf., pp. 107–117 (1998)
Google Scholar
Kleinberg, J.M.: Authoritative Sources in a Hyperlinked Environment. Journal of the ACM 46(5), 604–632 (1999)
Article MathSciNet MATH Google Scholar
Tomlin, J.A.: A New Paradigm for Ranking Pages on the World Wide Web. In: Proc. of. Int. WWW Conf., pp. 350–355 (2003)
Google Scholar
Rilo, E., Jones, R.: Learning Dictionaries for Information Extraction by Multi-Level Bootstrap**. In: Proc. of the 16th National Conf. on Articial Intelligence, pp. 811–816 (1999)
Google Scholar
Sonderland, S.: Learning information extraction rules for semi-structured and free text. Machine Learning 34(1), 233–272 (1999)
Article Google Scholar
Leek, T.R.: Information Extraction Using Hidden Markov Models. Master thesis, UC, San Diego (1997)
Google Scholar
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. of IEEE 77(2), 257–286 (1989)
Article Google Scholar
Bing, L., Robert, G., Yanhong, Z.: Mining data records in web pages. In: Proc. of ACM SIGKDD, pp. 601–606 (2003)
Google Scholar
Buttler, D., Liu, L., Pu, C.: A fully automated object extraction system for the world wide web. In: Proc.of IEEE ICDCS, pp. 361–370 (2001)
Google Scholar
Chang, C., Lui, S.: IEPAD: Information extraction based on Pattern Discovery. In: Proc. of WWW Conf., pp. 682–688 (2001)
Google Scholar
Park, D.-C., Kwon, O., Chung, J.: Centroid neural network with a divergence measure for GPDF data clustering. IEEE Trans. Neural Networks 19(6), 948–957 (2008)
Article Google Scholar
Jiang, J.: Modeling Syntactic Structures of Topics with a Nested HMM-LDA. In: Proc. of ICDM, pp. 824–829 (2009)
Google Scholar
Park, D.-C., Huong, V.T.L., Woo, D.-M., Hieu, D., Ninh, S.: Information Extraction System Based on Hidden Markov Model. In: Yu, W., He, H., Zhang, N. (eds.) ISNN 2009. LNCS, vol. 5551, pp. 55–59. Springer, Heidelberg (2009)
Google Scholar
Raghavan, V.V., Wang, G.S., Bollmann, P.: A Critical Investigation of Recall and Precision as Measures of Retrieval System Performance. ACM Trans. Info. Sys. 7(3), 205–229 (1989)
Article Google Scholar
http://www.cs.uic.edu/~liub/WebDataExtraction/MDR-download.html

Download references

Author information

Authors and Affiliations

Dept. of Electronics Engineering, Myong Ji University, Korea
Tae-Hyun Kim, Dong-Chul Park, Woong Huh, Hyen-Ug Kim, Chung-Hwa Yoon, Chong-Dae Park, Dong-Min Woo, Taikyeong Jeong & Il-Hwan Cho
System IC R&D Division, Korea Electronics Tech. Inst., Songnam, Korea
Yunsik Lee

Authors

Tae-Hyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Dong-Chul Park
View author publications
You can also search for this author in PubMed Google Scholar
Woong Huh
View author publications
You can also search for this author in PubMed Google Scholar
Hyen-Ug Kim
View author publications
You can also search for this author in PubMed Google Scholar
Chung-Hwa Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Chong-Dae Park
View author publications
You can also search for this author in PubMed Google Scholar
Dong-Min Woo
View author publications
You can also search for this author in PubMed Google Scholar
Taikyeong Jeong
View author publications
You can also search for this author in PubMed Google Scholar
Il-Hwan Cho
View author publications
You can also search for this author in PubMed Google Scholar
Yunsik Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The Key Laboratory of Manufacture and Test, Chongqing University of Technology, 400054, Chongqing, P. R. China
Ran Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, TH. et al. (2011). A Web Information Retrieval System. In: Chen, R. (eds) Intelligent Computing and Information Science. ICICIS 2011. Communications in Computer and Information Science, vol 135. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18134-4_81

Download citation

DOI: https://doi.org/10.1007/978-3-642-18134-4_81
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-18133-7
Online ISBN: 978-3-642-18134-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Web Information Retrieval System

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Product Information Retrieval on the Web: An Empirical Study

Web Information Retrieval and Search

Semistructured Data Search

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Web Information Retrieval System

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Product Information Retrieval on the Web: An Empirical Study

Web Information Retrieval and Search

Semistructured Data Search

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation