↓ Skip to main content

Structured feature selection using coordinate descent optimization

Overview of attention for article published in BMC Bioinformatics, April 2016
Altmetric Badge

Mentioned by

twitter
2 X users

Citations

dimensions_citation
26 Dimensions

Readers on

mendeley
42 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Structured feature selection using coordinate descent optimization
Published in
BMC Bioinformatics, April 2016
DOI 10.1186/s12859-016-0954-4
Pubmed ID
Authors

Mohamed F. Ghalwash, Xi Hang Cao, Ivan Stojkovic, Zoran Obradovic

Abstract

Existing feature selection methods typically do not consider prior knowledge in the form of structural relationships among features. In this study, the features are structured based on prior knowledge into groups. The problem addressed in this article is how to select one representative feature from each group such that the selected features are jointly discriminating the classes. The problem is formulated as a binary constrained optimization and the combinatorial optimization is relaxed as a convex-concave problem, which is then transformed into a sequence of convex optimization problems so that the problem can be solved by any standard optimization algorithm. Moreover, a block coordinate gradient descent optimization algorithm is proposed for high dimensional feature selection, which in our experiments was four times faster than using a standard optimization algorithm. In order to test the effectiveness of the proposed formulation, we used microarray analysis as a case study, where genes with similar expressions or similar molecular functions were grouped together. In particular, the proposed block coordinate gradient descent feature selection method is evaluated on five benchmark microarray gene expression datasets and evidence is provided that the proposed method gives more accurate results than the state-of-the-art gene selection methods. Out of 25 experiments, the proposed method achieved the highest average AUC in 13 experiments while the other methods achieved higher average AUC in no more than 6 experiments. A method is developed to select a feature from each group. When the features are grouped based on similarity in gene expression, we showed that the proposed algorithm is more accurate than state-of-the-art gene selection methods that are particularly developed to select highly discriminative and less redundant genes. In addition, the proposed method can exploit any grouping structure among features, while alternative methods are restricted to using similarity based grouping.

Timeline

Login to access the full chart related to this output.

If you don’t have an account, click here to discover Explorer

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
As of 1 July 2024, you may notice a temporary increase in the numbers of X profiles with Unknown location. Click here to learn more.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 42 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Brazil 1 2%
Unknown 41 98%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 11 26%
Researcher 6 14%
Student > Master 4 10%
Other 3 7%
Student > Postgraduate 3 7%
Other 5 12%
Unknown 10 24%
Readers by discipline Count As %
Computer Science 14 33%
Medicine and Dentistry 5 12%
Biochemistry, Genetics and Molecular Biology 4 10%
Nursing and Health Professions 2 5%
Agricultural and Biological Sciences 2 5%
Other 4 10%
Unknown 11 26%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 09 April 2016.
All research outputs
#18,450,346
of 22,860,626 outputs
Outputs from BMC Bioinformatics
#6,326
of 7,294 outputs
Outputs of similar age
#220,527
of 300,802 outputs
Outputs of similar age from BMC Bioinformatics
#101
of 116 outputs
Altmetric has tracked 22,860,626 research outputs across all sources so far. This one is in the 11th percentile – i.e., 11% of other outputs scored the same or lower than it.
So far Altmetric has tracked 7,294 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one is in the 5th percentile – i.e., 5% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 300,802 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 15th percentile – i.e., 15% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 116 others from the same source and published within six weeks on either side of this one. This one is in the 5th percentile – i.e., 5% of its contemporaries scored the same or lower than it.