↓ Skip to main content

Identification of Low-Confidence Regions in the Pig Reference Genome (Sscrofa10.2)

Overview of attention for article published in Frontiers in Genetics, November 2015
Altmetric Badge

About this Attention Score

  • In the top 5% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (95th percentile)
  • High Attention Score compared to outputs of the same age and source (96th percentile)

Mentioned by

blogs
1 blog
twitter
61 X users
facebook
1 Facebook page

Citations

dimensions_citation
29 Dimensions

Readers on

mendeley
41 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Identification of Low-Confidence Regions in the Pig Reference Genome (Sscrofa10.2)
Published in
Frontiers in Genetics, November 2015
DOI 10.3389/fgene.2015.00338
Pubmed ID
Authors

Amanda Warr, Christelle Robert, David Hume, Alan L. Archibald, Nader Deeb, Mick Watson

Abstract

Many applications of high throughput sequencing rely on the availability of an accurate reference genome. Variant calling often produces large data sets that cannot be realistically validated and which may contain large numbers of false-positives. Errors in the reference assembly increase the number of false-positives. While resources are available to aid in the filtering of variants from human data, for other species these do not yet exist and strict filtering techniques must be employed which are more likely to exclude true-positives. This work assesses the accuracy of the pig reference genome (Sscrofa10.2) using whole genome sequencing reads from the Duroc sow whose genome the assembly was based on. Indicators of structural variation including high regional coverage, unexpected insert sizes, improper pairing and homozygous variants were used to identify low quality (LQ) regions of the assembly. Low coverage (LC) regions were also identified and analyzed separately. The LQ regions covered 13.85% of the genome, the LC regions covered 26.6% of the genome and combined (LQLC) they covered 33.07% of the genome. Over half of dbSNP variants were located in the LQLC regions. Of copy number variable regions identified in a previous study, 86.3% were located in the LQLC regions. The regions were also enriched for gene predictions from RNA-seq data with 42.98% falling in the LQLC regions. Excluding variants in the LQ, LC, or LQLC from future analyses will help reduce the number of false-positive variant calls. Researchers using WGS data should be aware that the current pig reference genome does not give an accurate representation of the copy number of alleles in the original Duroc sow's genome.

Timeline

Login to access the full chart related to this output.

If you don’t have an account, click here to discover Explorer

X Demographics

X Demographics

The data shown below were collected from the profiles of 61 X users who shared this research output. Click here to find out more about how the information was compiled.
As of 1 July 2024, you may notice a temporary increase in the numbers of X profiles with Unknown location. Click here to learn more.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 41 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Finland 1 2%
Spain 1 2%
Netherlands 1 2%
Unknown 38 93%

Demographic breakdown

Readers by professional status Count As %
Researcher 14 34%
Student > Ph. D. Student 8 20%
Student > Master 6 15%
Professor 3 7%
Student > Doctoral Student 2 5%
Other 2 5%
Unknown 6 15%
Readers by discipline Count As %
Agricultural and Biological Sciences 26 63%
Biochemistry, Genetics and Molecular Biology 4 10%
Veterinary Science and Veterinary Medicine 2 5%
Unknown 9 22%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 42. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 23 July 2022.
All research outputs
#1,059,526
of 26,625,282 outputs
Outputs from Frontiers in Genetics
#165
of 14,018 outputs
Outputs of similar age
#16,670
of 398,535 outputs
Outputs of similar age from Frontiers in Genetics
#2
of 55 outputs
Altmetric has tracked 26,625,282 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 96th percentile: it's in the top 5% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 14,018 research outputs from this source. They receive a mean Attention Score of 3.9. This one has done particularly well, scoring higher than 98% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 398,535 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 95% of its contemporaries.
We're also able to compare this research output to 55 others from the same source and published within six weeks on either side of this one. This one has done particularly well, scoring higher than 96% of its contemporaries.