seqqc is a Nextflow pipeline for quality control of short- or long-read sequencing data. It quickly assesses the quality of sequencing data so that it can be posted to a public repository before analysis for biological insights. Faster open data, faster knowledge for everyone.
The sourmash Python package produces many outputs that describe the content and similarity of sequencing data. We developed a new R package, sourmashconsumr, that lets a wider range of users easily load, analyze, and visualize those outputs in R.
Feridun Mert Celebi, Elizabeth A. McDaniel, and Taylor Reiter
SC
+2
Published: Mar 07, 2023
A workflow orchestration framework can streamline repeatable tasks and make workflows broadly usable. From several options, we chose Nextflow due to the ease of deploying across platforms, vibrant nf-core community, and ability to manage and monitor workflows with Nextflow Tower.
Adair L. Borges, Rachel J. Dutton, Elizabeth A. McDaniel, Taylor Reiter, and Emily C.P. Weiss
RD
TR
Published: Mar 11, 2023
How do you approach getting a microbiome set up in a new lab? We’re sharing protocols for how we collected, stocked, and sequenced a set of cheese rind microbiomes and generated a high-quality metagenomics resource for future computational studies.
We want to swiftly generate genome assemblies and produce quality control statistics to gauge the need for more curation. We built a Nextflow pipeline that assembles Illumina, Nanopore, or PacBio sequencing reads for a single organism and runs QC checks on the resulting assembly.
We want to seamlessly process and summarize metagenomics data from Illumina or Nanopore technologies. We built a Nextflow workflow that handles common metagenomics tasks and produces useful outputs and intuitive visualizations.
Adair L. Borges, Rachel J. Dutton, Elizabeth A. McDaniel, Atanas Radkov, Taylor Reiter, and Emily C.P. Weiss
RD
+4
Published: Jul 19, 2023
We sampled cheese microbial communities to discover bacteriophages with unusual genome chemistries. We isolated 114 bacterial host strains and 17 phages, and identified one phage with a probable arabinose hypermodification of hydroxymethylcytosine.
Horizontal gene transfer (HGT) is the exchange of DNA between species. It can lead to the acquisition of new gene functions, so finding HGT events can reveal genome novelty. preHGT is a pipeline that uses multiple existing methods to quickly screen for transferred genes.
Rachel J. Dutton, Elizabeth A. McDaniel, and Manon Morin
RD
MM
DS
Published: Aug 15, 2023
Hoping to find proteins that alter physiology in useful ways, we screened venom data sets for toxins fused to domains with additional functionality. We identified candidates, but struggled to infer any novel functions, and none seem well-conserved across venomous species.
Prachee Avasthi, Feridun Mert Celebi, Elizabeth A. McDaniel, Kira E. Poskanzer, Michael E. Reitman, and Emily C.P. Weiss
SC
RD
+5
Published: Dec 20, 2023
Some human proteins are encoded by genes with repetitive sequences, which, if they expand, damage the nervous system and cause disorders like Huntington’s disease. We found animals with similar proteins that have more repeats than we’ve ever seen in healthy people.
Feridun Mert Celebi, Seemay Chou, Erin McGeever, Austin H. Patton, and Ryan York
SC
+4
Published: Sep 29, 2023
We want to find and use evolutionary innovations to solve present-day problems. We developed NovelTree, an efficient phylogenomic workflow that will empower us to decode the evolutionary traces of these innovations across the tree of life.
Prachee Avasthi, Brae M. Bigge, Feridun Mert Celebi, Keith Cheveralls, Jase Gehring, Erin McGeever, Gilad Mishne, Atanas Radkov, and 1 more
BB
KC
RD
+14
Published: Sep 29, 2023
The ProteinCartography pipeline identifies proteins related to a query protein using sequence- and structure-based searches, compares all protein structures, and creates a navigable map that can be used to look at protein relationships and make hypotheses about function.
Prachee Avasthi, Feridun Mert Celebi, and Elizabeth A. McDaniel
BB
+3
Published: Oct 06, 2023
Only some bacteria accumulate substantial amounts of polyphosphate (polyP). We thought that despite sequence divergence, polyP synthesis enzymes in these bacteria might have similar structures. We found this is sometimes true but doesn’t fully explain the phenomenon.
Feridun Mert Celebi, Megan L. Hochstrasser, Elizabeth A. McDaniel, and Jasmine Neal
MD
Published: Dec 20, 2023
Since releasing our pub on polyphosphate-forming proteins in bacteria, we’ve noticed the community has similar problems studying this process in diverse organisms. We’re actively seeking feedback with a focus on advancing basic discoveries and useful tools in this space!
Feridun Mert Celebi, Seemay Chou, Elizabeth A. McDaniel, Taylor Reiter, and Emily C.P. Weiss
SC
RD
+2
Published: Feb 24, 2024
We previously released a draft genome assembly for the lone star tick, A. americanum. We've now predicted genes from this assembly to use for downstream functional characterization and comparative genomics efforts.
Brae M. Bigge, Adair L. Borges, Seemay Chou, Elizabeth A. McDaniel, Kira E. Poskanzer, and Ryan York
BB
SC
RD
+5
Published: Aug 09, 2024
Inspired by wasps co-opting viral capsids to deliver genes to the caterpillars they parasitize, we looked for capsid-like proteins in other species. We found capsid homologs in ticks and other parasites, suggesting this phenomenon could be wider spread than previously known.
We assembled a comprehensive E. coli antimicrobial resistance phenotype-genotype resource. This dataset will aid large-scale genetic studies on anti-microbial resistance and support research in phylogenetics and other fields.
Adair L. Borges, Feridun Mert Celebi, Reilly O. Cooper, and Elizabeth A. McDaniel
RC
+1
Published: Aug 22, 2024
This workflow lets you find potential circular DNA in your organism of interest using short-read, whole-genome sequencing data and a reference genome. We applied it to parasitoid wasps and some other parasites and found putative circular DNA.
Adair L. Borges, Seemay Chou, Austin H. Patton, Taylor Reiter, Emily C.P. Weiss, and Ryan York
SC
RD
+5
Published: Jan 02, 2025
We investigated patterns of gene family evolution across ticks and other parasites. We used phylogenetic profiling and trait-association tests to identify gene families that may enable parasitic species to feed on hosts undetected for prolonged periods.
Adair L. Borges, Taylor Reiter, and Emily C.P. Weiss
TR
Published: Jan 02, 2025
We analyzed RNA-seq data from Amblyomma americanum to explore gene expression linked to skin manipulation during tick feeding. We built an interactive app to explore the differential expression results and find patterns related to tick sex, tissue, and time in blood meal.