Search Page | SpringerLink

Parallel and private generalized suffix tree construction and query on genomic data

Background

Several technological advancements and digitization of healthcare data have provided the scientific community with a large quantity of...

Md Momin Al Aziz, Parimala Thulasiraman, Noman Mohammed in BMC Genomic Data

Article Open access 17 June 2022

SamQL: a structured query language and filtering tool for the SAM/BAM file format

Background

The Sequence Alignment/Map Format Specification (SAM) is one of the most widely adopted file formats in bioinformatics and many researchers...

Christopher T. Lee, Manolis Maragkakis in BMC Bioinformatics

Article Open access 02 October 2021

Fast, parallel, and cache-friendly suffix array construction

Purpose

String indexes such as the suffix array ( sa ) and the closely related longest common prefix ( lcp ) array are fundamental objects in...

Jamshed Khan, Tobias Rubel, ... Rob Patro in Algorithms for Molecular Biology

Article Open access 28 April 2024

gsufsort: constructing suffix arrays, LCP arrays and BWTs for string collections

Background

The construction of a suffix array for a collection of strings is a fundamental task in Bioinformatics and in many other applications that...

Felipe A. Louza, Guilherme P. Telles, ... Giovanna Rosone in Algorithms for Molecular Biology

Article Open access 22 September 2020

Prediction of plant secondary metabolic pathways using deep transfer learning

Background

Plant secondary metabolites are highly valued for their applications in pharmaceuticals, nutrition, flavors, and aesthetics. It is of great...

Han Bao, **hui Zhao, ... Guowang Xu in BMC Bioinformatics

Article Open access 19 September 2023

Finding maximal exact matches in graphs

Background

We study the problem of finding maximal exact matches (MEMs) between a query string Q and a labeled graph G . MEMs are an important class...

Nicola Rizzo, Manuel Cáceres, Veli Mäkinen in Algorithms for Molecular Biology

Article Open access 11 March 2024

MCProj: metacell projection for interpretable and quantitative use of transcriptional atlases

We describe MCProj—an algorithm for analyzing query scRNA-seq data by projections over reference single-cell atlases. We represent the reference as a...

Oren Ben-Kiki, Akhiad Bercovich, ... Amos Tanay in Genome Biology

Article Open access 05 October 2023

Indexing and searching petabase-scale nucleotide resources

Searching vast and rapidly growing nucleotide content in resources, such as runs in the Sequence Read Archive and assemblies for whole-genome shotgun...

Sergey A. Shiryev, Richa Agarwala in Nature Methods

Article 16 May 2024

Pfp-fm: an accelerated FM-index

FM-indexes are crucial data structures in DNA alignment, but searching with them usually takes at least one random access per character in the query...

Aaron Hong, Marco Oliva, ... Travis Gagie in Algorithms for Molecular Biology

Article Open access 10 April 2024

Genome-wide screening reveals the genetic basis of mammalian embryonic eye development

Background

Microphthalmia, anophthalmia, and coloboma (MAC) spectrum disease encompasses a group of eye malformations which play a role in childhood...

Justine M. Chee, Louise Lanoue, ... Ala Moshiri in BMC Biology

Article Open access 03 February 2023

The flax genome reveals orbitide diversity

Background

Ribosomally-synthesized cyclic peptides are widely found in plants and exhibit useful bioactivities for humans. The identification of...

Ziliang Song, Connor Burbridge, ... Martin J. T. Reaney in BMC Genomics

Article Open access 23 July 2022

Finding identical sequence repeats in multiple protein sequences: An algorithm

In recent years, several experimental evidences suggest that amino acid repeats are closely linked to many disease conditions, as they have a...

Vikas Kumar Maurya, Madhumathi Sanjeevi, ... Sekar Kanagaraj in Journal of Biosciences

Article 28 February 2024

Fast and robust metagenomic sequence comparison through sparse chaining with skani

Sequence comparison tools for metagenome-assembled genomes (MAGs) struggle with high-volume or low-quality data. We present skani ( https://github.com/bluenote-1577/skani...

Jim Shaw, Yun William Yu in Nature Methods

Article Open access 21 September 2023

Suffix sorting via matching statistics

We introduce a new algorithm for constructing the generalized suffix array of a collection of highly similar strings. As a first step, we construct a...

Zsuzsanna Lipták, Francesco Masillo, Simon J. Puglisi in Algorithms for Molecular Biology

Article Open access 12 March 2024

Fulgor: a fast and compact k-mer index for large-scale matching and color queries

The problem of sequence identification or matching—determining the subset of reference sequences from a given collection that are likely to contain a...

Jason Fan, Jamshed Khan, ... Rob Patro in Algorithms for Molecular Biology

Article Open access 22 January 2024

Efficient privacy-preserving variable-length substring match for genome sequence

The development of a privacy-preserving technology is important for accelerating genome data sharing. This study proposes an algorithm that securely...

Yoshiki Nakagawa, Satsuya Ohata, Kana Shimizu in Algorithms for Molecular Biology

Article Open access 26 April 2022

Intrinsic disorder in PRAME and its role in uveal melanoma

Introduction

The PReferentially expressed Antigen in MElanoma ( PRAME) protein has been shown to be an independent biomarker for increased risk of...

Michael Antonietti, David J. Taylor Gonzalez, ... Carol L. Karp in Cell Communication and Signaling

Article Open access 25 August 2023

An optimized FM-index library for nucleotide and amino acid search

Background

Pattern matching is a key step in a variety of biological sequence analysis pipelines. The FM-index is a compressed data structure for...

Tim Anderson, Travis J. Wheeler in Algorithms for Molecular Biology

Article Open access 31 December 2021

EZCancerTarget: an open-access drug repurposing and data-collection tool to enhance target validation and optimize international research efforts against highly progressive cancers

The expanding body of potential therapeutic targets requires easily accessible, structured, and transparent real-time interpretation of molecular...

David Dora, Timea Dora, ... Zoltan Lohinai in BioData Mining

Article Open access 01 October 2022

Integration of fingerprint-based similarity searching and kernel-based partial least squares analysis to predict inhibitory activity against CSK, HER2, JAK1, JAK2, and JAK3

Fingerprint-based similarity searching is an important strategy for virtual screening in drug discovery. In the present study, we carried out a...

Hemantkumar Deokar, Mrunalini Deokar, John K. Buolamwini in Molecular Diversity

Article 17 January 2023

Search

Filters

Search Results

Search

Navigation