. 2009 Aug 15;25(16):2078-9.

doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.

The Sequence Alignment/Map format and SAMtools

Heng Li¹, Bob Handsaker, Alec Wysoker, Tim Fennell, Jue Ruan, Nils Homer, Gabor Marth, Goncalo Abecasis, Richard Durbin, 1000 Genome Project Data Processing Subgroup

Affiliations

PMID:19505943
PMCID:PMC2723002
DOI:10.1093/bioinformatics/btp352

Free PMC article

The Sequence Alignment/Map format and SAMtools

Heng Liet al. Bioinformatics. 2009.

Free PMC article

. 2009 Aug 15;25(16):2078-9.

doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.

Authors

Heng Li¹, Bob Handsaker, Alec Wysoker, Tim Fennell, Jue Ruan, Nils Homer, Gabor Marth, Goncalo Abecasis, Richard Durbin, 1000 Genome Project Data Processing Subgroup

Affiliation

¹Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, CB10 1SA, UK, Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA.

PMID:19505943
PMCID:PMC2723002
DOI:10.1093/bioinformatics/btp352

Abstract

Summary:The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.

Availability:http://samtools.sourceforge.net.

Figures

**Fig. 1.**
Example of extended CIGAR and the pileup output. (a) Alignments of one pair of reads and three single-end reads. (b) The corresponding SAM file. The ‘ @SQ’ line in the header section gives the order of reference sequences. Notably, r001is the name of a read pair. According to FLAG163 (=1 + 2 + 32 + 128), the read mapped to position 7 is the second read in the pair (128) and regarded as properly paired (1 + 2); its mate is mapped to 37 on the reverse strand (32). Read r002has three soft-clipped (unaligned) bases. The coordinate shown in SAM is the position of the first aligned base. The CIGAR string for this alignment contains a P(padding) operation which correctly aligns the inserted sequences. Padding operations can be absent when an aligner does not support multiple sequence alignment. The last six bases of read r003map to position 9, and the first five to position 29 on the reverse strand. The hard clipping operation Hindicates that the clipped sequence is not present in the sequence field. The NMtag gives the number of mismatches. Read r004is aligned across an intron, indicated by the Noperation. (c) Simplified pileup output by SAMtools. Each line consists of reference name, sorted coordinate, reference base, the number of reads covering the position and read bases. In the fifth field, a dot or a comma denotes a base identical to the reference; a dot or a capital letter denotes a base from a read mapped on the forward strand, while a comma or a lowercase letter on the reverse strand.

See this image and copyright information in PMC

Cited by

Rapid and sensitive single-cell RNA sequencing with SHERRY2.
Di L, Liu B, Lyu Y, Zhao S, Pang Y, Zhang C, Wang J, Qi H, Shen J, Huang Y. Di L, et al. BMC Biol. 2022 Sep 30;20(1):213. doi: 10.1186/s12915-022-01416-x. BMC Biol. 2022. PMID:36175891 Free PMC article.
DNA methylation landscapes from pig's limbic structures underline regulatory mechanisms relevant for brain plasticity.
Perdomo-Sabogal A, Trakooljul N, Hadlich F, Murani E, Wimmers K, Ponsuksili S. Perdomo-Sabogal A, et al. Sci Rep. 2022 Sep 29;12(1):16293. doi: 10.1038/s41598-022-20682-x. Sci Rep. 2022. PMID:36175587 Free PMC article.
Spontaneous activity in whisker-innervating region of neonatal mouse trigeminal ganglion.
Banerjee P, Kubo F, Nakaoka H, Ajima R, Sato T, Hirata T, Iwasato T. Banerjee P, et al. Sci Rep. 2022 Sep 29;12(1):16311. doi: 10.1038/s41598-022-20068-z. Sci Rep. 2022. PMID:36175429 Free PMC article.
Regain flood adaptation in rice through a 14-3-3 protein OsGF14h.
Sun J, Zhang G, Cui Z, Kong X, Yu X, Gui R, Han Y, Li Z, Lang H, Hua Y, Zhang X, Xu Q, Tang L, Xu Z, Ma D, Chen W. Sun J, et al. Nat Commun。2022 Sep 29;13(1):5664. doi: 10.1038/s41467-022-33320-x. Nat Commun。2022. PMID:36175427 Free PMC article.
MYPT1-PP1β phosphatase negatively regulates both chromatin landscape and co-activator recruitment for beige adipogenesis.
Takahashi H, Yang G, Yoneshiro T, Abe Y, Ito R, Yang C, Nakazono J, Okamoto-Katsuyama M, Uchida A, Arai M, Jin H, Choi H, Tumenjargal M, Xie S, Zhang J, Sagae H, Zhao Y, Yamaguchi R, Nomura Y, Shimizu Y, Yamada K, Yasuda S, Kimura H, Tanaka T, Wada Y, Kodama T, Aburatani H, Zhu MS, Inagaki T, Osborne TF, Kawamura T, Ishihama Y, Matsumura Y, Sakai J. Takahashi H, et al. Nat Commun。2022 Sep 29;13(1):5715. doi: 10.1038/s41467-022-33363-0. Nat Commun。2022. PMID:36175407 Free PMC article.

See all "Cited by" articles

References

1. Kent WJ, et al. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. -PMC-PubMed
1. Langmead B, et al. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10:R25. -PMC-PubMed
1. Li H, et al. Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 2008;18:1851–1858. -PMC-PubMed
1. Mardis ER. Next-generation DNA sequencing methods. Annu. Rev. Genomics Hum. Genet. 2008;9:387–402. -PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grant support

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations
Medical
- ClinicalTrials.gov
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The Sequence Alignment/Map format and SAMtools

Affiliation

The Sequence Alignment/Map format and SAMtools

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grant support

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Miscellaneous

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grant support

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Miscellaneous