MABA16S

A pipeline for analyzing Oxford Nanopore 16S rRNA sequencing data from clinical samples.

Free software: MIT license

Overview

MABA16S processes 16S rRNA sequencing data to classify reads at the genus and species levels, providing detailed taxonomic identification for clinical microbiology.

Tools and Settings

Filtlong:
- Minimum read length: 1200 bp
Kraken2:
- Genera with a minimum of 50 reads are processed further
extract_kraken_reads.py from kraken-tools
Minimap2: alignment to SILVA reference sequences for specific genera
Samtools consensus
BLASTn

How does it work?

reads are classified on genus level using kraken2 and SILVA database
reads for each genus are extracted
each genus readset is mapped to the first species in the SILVA database of this genus
consensus sequence is extracted and BLASTed to the SILVA database to obtain a species ID
results are compiled and written to a spreadsheet

Quickstart

As a quickstart to use this pipeline you need Python 3.6 or higher, conda environment manager and snakemake.

Usage

git clone https://github.com/MUMC-MEDMIC/MABA16S 
cd MABA16S/maba16s
python cli.py snakemake -i folders_containing_nanopore16s_reads -o my_output_directory --cores 1 

# input are directories which hold your nanopore reads. Naming of the output will be done based on the names of these directories

Output File Structure

The output directory contains the following structure:

my_output_directory/ 
├── kraken2/  
│   ├── {sample}/  
│   │   ├── krakenreport_filtered.txt  # Filtered Kraken2 report  
│   │   ├── output.txt                # Full Kraken2 classification output  
│   │   ├── reads/                    # Genus-specific reads (FASTQ files)  
├── kraken2consensus/  
│   ├── {sample}/  
│   │   ├── reference_fastas/         # Reference FASTA files used for alignment  
│   │   ├── aligned_reads/            # BAM files for aligned reads  
│   │   ├── consensus_fastas/         # Consensus FASTA files  
├── BLAST/  
│   ├── {sample}/  
│   │   ├── *_BLASTn.txt              # BLASTn results for consensus sequences  
├── QC/  
│   ├── {sample}/  
│   │   ├── {sample}_qcPreprocessing.txt  # Preprocessing QC metrics  
│   │   ├── {sample}_qcPostAnalysis.txt   # Extended QC metrics  
├── reports/  
│   ├── {sample}.xlsx                 # Comprehensive report for each sample  
├── sankeys/  
    ├── {sample}_sankey.html          # Interactive Sankey diagrams

Credits

This package was created with Cookiecutter_ and the audreyr/cookiecutter-pypackage_ project template.

.. _Cookiecutter: https://github.com/audreyr/cookiecutter .. _audreyr/cookiecutter-pypackage: https://github.com/audreyr/cookiecutter-pypackage

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
maba16s		maba16s
.editorconfig		.editorconfig
.gitignore		.gitignore
.travis.yml		.travis.yml
AUTHORS.rst		AUTHORS.rst
CONTRIBUTING.rst		CONTRIBUTING.rst
Dockerfile		Dockerfile
HISTORY.rst		HISTORY.rst
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
requirements_dev.txt		requirements_dev.txt
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MABA16S

Overview

Tools and Settings

How does it work?

Quickstart

Output File Structure

Credits

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

MUMC-MEDMIC/MABA16S

Folders and files

Latest commit

History

Repository files navigation

MABA16S

Overview

Tools and Settings

How does it work?

Quickstart

Output File Structure

Credits

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages