Abstract
For a given query, searching for entities that conform to the description facts in the given set, in view of this goal, this paper proposes a matching method based on classification and semantic extension. The algorithm firstly to classify the query string into three categories, and extract the key word of different categories of query word. Then the keyword is extended to get the matching word set based on the word2vec word vector model. At last we calculate the score of every entity by the weighted matching method and get results according to the score ranking. After the experiment, the method get the correct rate of 63.2%, which has good applicability, and to a certain extent, it reduces the retrieval failure rate due to the query of the spoken language and diversification.
This work is supported by the National Natural Science Foundation of China under Grants No. 61271304, 61671070, Bei**g Advanced Innovation Center for Imaging Technology BAICIT-2016003, National Social Science Foundation of China under Grants No. 14@ZH036, 15ZDB017, National Language Committee of China under Grants No. ZDA125-26.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
**, J., Zhiming, C.: Intelligent search engine user interest model analysis and research. Microelectron. Comput. 11, 24–26 (2004)
**uli, H., Qiaoming, Z., Li, P.F.: The combination of semantic analysis and word frequency statistics of Chinese text similarity measurement method. Appl. Res. Comput. 03, 833–836 (2012)
Wolf, L., et al.: Joint word2vec networks for bilingual semantic representations. Int. J. Comput. Linguist. Appl. 5(1), 27–44 (2014)
Cheng, T., Chang, K.C.C.: Entity search engine: towards agile best-effort information integration over the web. In: CIDR, vol. 2007 (2007)
Balog, K., Bron, M., De Rijke, M.: Query modeling for entity search based on terms, categories, and examples. ACM Trans. Inf. Syst. (TOIS) 29(4), 22 (2011)
Neumayer, R., Balog, K., Nørvåg, K.: On the modeling of entities for ad-hoc entity search in the web of data. In: Baeza-Yates, R., Vries, A.P., Zaragoza, H., Cambazoglu, B.B., Murdock, V., Lempel, R., Silvestri, F. (eds.) ECIR 2012. LNCS, vol. 7224, pp. 133–145. Springer, Heidelberg (2012). doi:10.1007/978-3-642-28997-2_12
Bron, M., Balog, K., Rijke, M.: Example based entity search in the web of data. In: Serdyukov, P., Braslavski, P., Kuznetsov, S.O., Kamps, J., Rüger, S., Agichtein, E., Segalovich, I., Yilmaz, E. (eds.) ECIR 2013. LNCS, vol. 7814, pp. 392–403. Springer, Heidelberg (2013). doi:10.1007/978-3-642-36973-5_33
Balog, K., Bron, M., Rijke, M., Weerkamp, W.: Combining term-based and category-based representations for entity search. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 265–272. Springer, Heidelberg (2010). doi:10.1007/978-3-642-14556-8_27
Freitas, A., et al.: Treo: combining entity-search, spreading activation and semantic relatedness for querying linked data. In: Proceedings of 1st Workshop on Question Answering over Linked Data (QALD-1) at the 8th Extended Semantic Web Conference (ESWC 2011) (2011)
Rong, X.: word2vec parameter learning explained. ar**v preprint ar**v (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Wang, T., Lv, X., Ma, X., Sun, P., Dong, Z., Zhou, J. (2016). Study on the Method of Precise Entity Search Based on Baidu’s Query. In: Lin, CY., Xue, N., Zhao, D., Huang, X., Feng, Y. (eds) Natural Language Understanding and Intelligent Applications. ICCPOL NLPCC 2016 2016. Lecture Notes in Computer Science(), vol 10102. Springer, Cham. https://doi.org/10.1007/978-3-319-50496-4_74
Download citation
DOI: https://doi.org/10.1007/978-3-319-50496-4_74
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50495-7
Online ISBN: 978-3-319-50496-4
eBook Packages: Computer ScienceComputer Science (R0)