Search
Search Results
-
Assessment of the effects of different variable weights on wildfire susceptibility
In this study, wildfire susceptibility is mapped using various multi-criteria decision analysis techniques (AHP, SAW and VIKOR) and machine learning...
-
Predicting Chromatin Interactions from DNA Sequence Using DeepC
The genome 3D structure is central to understanding how disease-associated genetic variants in the noncoding genome regulate their target genes.... -
Structural characterizations and α-glucosidase inhibitory activities of four Lepidium meyenii polysaccharides with different molecular weights
Four polysaccharides (MCPa, MCPb, MCPc, MCPd) were obtained from Lepidium meyenii Walp. Their structures were characterized by chemical and...
-
Direct prediction of intrinsically disordered protein conformational properties from sequence
Intrinsically disordered regions (IDRs) are ubiquitous across all domains of life and play a range of functional roles. While folded domains are...
-
High-throughput deep learning variant effect prediction with Sequence UNET
Understanding coding mutations is important for many applications in biology and medicine but the vast mutation space makes comprehensive...
-
PepAnalyzer: predicting peptide properties using its sequence
Peptides are short linear molecules consisting of amino acids that play an essential role in most biological processes. They can treat diseases by...
-
Hist2Vec: Kernel-Based Embeddings for Biological Sequence Classification
Biological sequence classification is vital in various fields, such as genomics and bioinformatics. The advancement and reduced cost of genomic... -
Sequence-Based Nanobody-Antigen Binding Prediction
Nanobodies (Nb) are monomeric heavy-chain fragments derived from heavy-chain only antibodies naturally found in Camelids and Sharks. Their... -
Multiple Sequence Alignment
A multiple sequence alignment (MSA) is an alignment of three or more sequences. Given n strings S1, ... , Sn (... -
Scalable and unbiased sequence-informed embedding of single-cell ATAC-seq data with CellSpace
Standard scATAC sequencing (scATAC-seq) analysis pipelines represent cells as sparse numeric vectors relative to an atlas of peaks or genomic tiles...
-
Sensitive inference of alignment-safe intervals from biodiverse protein sequence clusters using EMERALD
Sequence alignments are the foundations of life science research, but most innovation so far focuses on optimal alignments, while information derived...
-
Anti-Hebbian plasticity drives sequence learning in striatum
Spatio-temporal activity patterns have been observed in a variety of brain areas in spontaneous activity, prior to or during action, or in response...
-
TFscope: systematic analysis of the sequence features involved in the binding preferences of transcription factors
Characterizing the binding preferences of transcription factors (TFs) in different cell types and conditions is key to understand how they...
-
The anticancer compound JTE-607 reveals hidden sequence specificity of the mRNA 3′ processing machinery
JTE-607 is an anticancer and anti-inflammatory compound and its active form, compound 2, directly binds to and inhibits CPSF73, the endonuclease for...
-
Fast and robust metagenomic sequence comparison through sparse chaining with skani
Sequence comparison tools for metagenome-assembled genomes (MAGs) struggle with high-volume or low-quality data. We present skani (
https://github.com/bluenote-1577/skani... -
Using pre-selected variants from large-scale whole-genome sequence data for single-step genomic predictions in pigs
BackgroundWhole-genome sequence (WGS) data harbor causative variants that may not be present in standard single nucleotide polymorphism (SNP) chip...
-
HostNet: improved sequence representation in deep neural networks for virus-host prediction
BackgroundThe escalation of viruses over the past decade has highlighted the need to determine their respective hosts, particularly for emerging ones...
-
Identification of the Sequence and the Length of Telomere DNA
Telomeres are essential nucleoprotein structures at the very ends of linear eukaryote chromosomes. They shelter the terminal genome territories... -
Clustering biological sequences with dynamic sequence similarity threshold
BackgroundBiological sequence clustering is a complicated data clustering problem owing to the high computation costs incurred for pairwise sequence...
-
iProL: identifying DNA promoters from sequence information based on Longformer pre-trained model
Promoters are essential elements of DNA sequence, usually located in the immediate region of the gene transcription start sites, and play a critical...