-
Article
Open AccessDeepGRP: engineering a software tool for predicting genomic repetitive elements using Recurrent Neural Networks with attention
Repetitive elements contribute a large part of eukaryotic genomes. For example, about 40 to 50% of human, mouse and rat genomes are repetitive. So identifying and classifying repeats is an important step in ge...
-
Article
Open AccessFast online and index-based algorithms for approximate search of RNA sequence-structure patterns
It is well known that the search for homologous RNAs is more effective if both sequence and structure information is incorporated into the search. However, current tools for searching with RNA sequence-structu...
-
Article
Open AccessReadjoiner: a fast and memory efficient string graph-based sequence assembler
Ongoing improvements in throughput of the next-generation sequencing technologies challenge the current generation of de novo sequence assemblers. Most recent sequence assemblers are based on the construction ...
-
Article
Open AccessStructator: fast index-based search for RNA sequence-structure patterns
The secondary structure of RNA molecules is intimately related to their function and often more conserved than the sequence. Hence, the important task of searching databases for RNAs requires to match sequence...
-
Article
Open AccessCoCoNUT: an efficient system for the comparison and analysis of genomes
Comparative genomics is the analysis and comparison of genomes from different species. This area of research is driven by the large number of sequenced genomes and heavily relies on efficient algorithms and so...
-
Article
Open AccessEfficient computation of absent words in genomic sequences
Analysis of sequence composition is a routine task in genome research. Organisms are characterized by their base composition, dinucleotide relative abundance, codon usage, and so on. Unique subsequences are ma...
-
Article
Open AccessLTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons
Transposable elements are abundant in eukaryotic genomes and it is believed that they have a significant impact on the evolution of gene and chromosome structure. While there are several completed eukaryotic g...
-
Article
Open AccessOptimising oligonucleotide array design for ChIP-on-chip
-
Article
Open AccessFast index based algorithms and software for matching position specific scoring matrices
In biological sequence analysis, position specific scoring matrices (PSSMs) are widely used to represent sequence motifs in nucleotide as well as amino acid sequences. Searching with PSSMs in complete genomes ...