An Efficient Two-Stage Gene Selection Method for Microarray Data

Du, Dajun; Li, Kang; Deng, **g

doi:10.1007/978-3-642-37105-9_47

Dajun Du^4,5,
Kang Li⁵ &
**g Deng⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 355))

Included in the following conference series:

International Conference on Intelligent Computing for Sustainable Energy and Environment

3182 Accesses
3 Citations

Abstract

Gene selection is a key issue in the analysis of microarray data with small samples and variant correlation. The main objective of this paper is to select the most informative genes from thousands of genes with strong correlation. This is achieved by proposing an efficient two-stage gene selection (TSGS) algorithm. In this algorithm, the L ₂-norm penalty are firstly introduced to achieve the grou** effect for the highly correlated genes. To overcome the small samples problem, the augmented data technique is then used to produce an augmented data set. Finally, by using the recently proposed two-stage algorithm, the most informative genes can be selected effectively. Simulation results confirm its effectiveness of the proposed approach in comparison with the popular Elastic Net method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 42.79; Price includes VAT (Germany)

Softcover Book: EUR 53.49; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Robust gene selection methods using weighting schemes for microarray data analysis

Article Open access 02 September 2017

Weighted-SAMGSR: combining significance analysis of microarray-gene set reduction algorithm with pathway topology-based weights to select relevant genes

Article Open access 29 September 2016

A robust and stable gene selection algorithm based on graph theory and machine learning

Article Open access 09 November 2021

References

Golub, T.R., et al.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)
Article Google Scholar
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. Journal of Machine Learning Research 3, 1157–1182 (2003)
MATH Google Scholar
Liu, B., Wan, C., Wang, L.: An efficient semi-unsupervised gene selecttion method via spectra biclustering. IEEE Transactions on Nanobioscience 5(2), 110–114 (2006)
Article Google Scholar
Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artificial Intelligence 97(1-2), 273–324 (1997)
Article MATH Google Scholar
Cai, R., Hao, Z., Yang, X., Wen, W.: An efficient gene selection algorithm based on mutual information. Neurocomputing 72, 991–999 (2009)
Article Google Scholar
Zhou, X., Mao, K.Z.: LS bound based gene selection for DNA micorarray data. Bioinformatics 21(8), 1559–1564 (2005)
Article Google Scholar
Freund, Y., Schapire, R.: A dicision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55, 119–139 (1997)
Article MathSciNet MATH Google Scholar
Zou, H., Hastie, T.: Regularization and variable selection via the elastic net. J.R. Statist. Soc.B 67(2), 301–320 (2005)
Article MathSciNet MATH Google Scholar
Li, K., Peng, J.X., Bai, E.W.: A two-stage algorithm for identification of nonlinear dynamic systems. Automatica 42(7), 1189–1197 (2006)
Article MathSciNet MATH Google Scholar
Marquardt, D.W.: Generalized inverses, ridge regression, biased linerar estimation, and nonlinear estimation. Technometrics 12(3), 591–612 (1970)
Article MATH Google Scholar
Nelles, O.: Nonlinear system identification. Springer (2001)
Google Scholar
Sha, N., Vannucci, M., Brown, P., Trower, M., Amphlett, G.: Gene selection in arthritis classification with large-scale microarray expression profiles. Comparative and Functional Genomics 4, 171–181 (2003)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Key Laboratory of Power Station Automation Technology, Department of Automation, Shanghai University, 200072, Shanghai, China
Dajun Du
School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, United Kingdom
Dajun Du, Kang Li & **g Deng

Authors

Dajun Du
View author publications
You can also search for this author in PubMed Google Scholar
Kang Li
View author publications
You can also search for this author in PubMed Google Scholar
**g Deng
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, Ashby Building, Stranmillis Road, BT9 5AH, Belfast, UK
Kang Li
Department of Automation, Shanghai Jiao Tong University, 800 Dongchuan Road, 200240, Shanghai, China
Shaoyuan Li & Dewei Li &
School of Mechatronics Engineering and Automation, Shanghai University, 149 Yanchang Road, 200072, Shanghai, China
Qun Niu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Du, D., Li, K., Deng, J. (2013). An Efficient Two-Stage Gene Selection Method for Microarray Data. In: Li, K., Li, S., Li, D., Niu, Q. (eds) Intelligent Computing for Sustainable Energy and Environment. ICSEE 2012. Communications in Computer and Information Science, vol 355. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37105-9_47

Download citation

DOI: https://doi.org/10.1007/978-3-642-37105-9_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37104-2
Online ISBN: 978-3-642-37105-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Efficient Two-Stage Gene Selection Method for Microarray Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Robust gene selection methods using weighting schemes for microarray data analysis

Weighted-SAMGSR: combining significance analysis of microarray-gene set reduction algorithm with pathway topology-based weights to select relevant genes

A robust and stable gene selection algorithm based on graph theory and machine learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

An Efficient Two-Stage Gene Selection Method for Microarray Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Robust gene selection methods using weighting schemes for microarray data analysis

Weighted-SAMGSR: combining significance analysis of microarray-gene set reduction algorithm with pathway topology-based weights to select relevant genes

A robust and stable gene selection algorithm based on graph theory and machine learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation