`dnacount.go`

A quick script in Go to count the base pair frequencies in genomes w/ parallelization across named FASTA labels/regions. With dnacount.go, I computed that the human genome has a GC bias of 41%. See results below (bottom of page).

Only dependency is https://github.com/schollz/progressbar to show a progress bar for very long sequences.

Build

go build

Execute

./dnacount data/repeat_GCF_000863945.3_ViralProj15505_genomic.fna

returns

Loaded 'data/GCF_000863945.3_ViralProj15505_genomic.fna' into RAM
 100% |████████████████████████████████████████| [0s:0s]            
Total length 8006
G 19.12%
C 17.42%
T 30.60%
A 32.86%
GC bias of 36.54%

Entire Human Genome

Download reference human genome https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_000001405.40/

wget -O human_genome.zip https://api.ncbi.nlm.nih.gov/datasets/v2/genome/accession/GCF_000001405.40/download?include_annotation_type=GENOME_FASTA && unzip human_genome.zip -d data/human_genome && rm -fr human_genome.zip

execute

time ./dnacount data/human_genome/ncbi_dataset/data/GCF_000001405.40/GCF_000001405.40_GRCh38.p14_genomic.fna

returns

Loaded 'data/human_genome/ncbi_dataset/data/GCF_000001405.40/GCF_000001405.40_GRCh38.p14_genomic.fna' into RAM
 100% |████████████████████████████████████████| [32s:0s]            
Total length 3339662079
G 20.57%
C 20.48%
T 29.52%
A 29.43%
GC bias of 41.05%
./dnacount   291.64s user 3.66s system 733% cpu 40.242 total

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dnacount.go		dnacount.go
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`dnacount.go`

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

dnacount.go

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages

`dnacount.go`