Skip to main content

previous disabled Page of 3
and
  1. Article

    Open Access

    Revisiting the complexity of and algorithms for the graph traversal edit distance and its variants

    The graph traversal edit distance (GTED), introduced by Ebrahimpour Boroojeny et al. (2018), is an elegant distance measure defined as the minimum edit distance between strings reconstructed from Eulerian trai...

    Yutong Qiu, Yihang Shen, Carl Kingsford in Algorithms for Molecular Biology (2024)

  2. No Access

    Chapter and Conference Paper

    Graph-Based Genome Inference from Hi-C Data

    Three-dimensional chromosome structure plays an important role in fundamental genomic functions. Hi-C, a high-throughput, sequencing-based technique, has drastically expanded our comprehension of 3D chromosome...

    Yihang Shen, Lingge Yu, Yutong Qiu in Research in Computational Molecular Biology (2024)

  3. No Access

    Chapter and Conference Paper

    A Scalable Optimization Algorithm for Solving the Beltway and Turnpike Problems with Uncertain Measurements

    The Beltway and Turnpike problems entail the reconstruction of circular and linear one-dimensional point sets from unordered pairwise distances. These problems arise in computational biology when the measurements...

    C. S. Elder, Minh Hoang, Mohsen Ferdosi in Research in Computational Molecular Biology (2024)

  4. No Access

    Chapter and Conference Paper

    DeepMinimizer: A Differentiable Framework for Optimizing Sequence-Specific Minimizer Schemes

    Minimizers are k-mer sampling schemes designed to generate sketches for large sequences that preserve sufficiently long matches between sequences. Despite their widespread application, learning an effective mi...

    Minh Hoang, Hongyu Zheng, Carl Kingsford in Research in Computational Molecular Biology (2022)

  5. Article

    Open Access

    VariantStore: an index for large-scale genomic variant search

    Efficiently scaling genomic variant search indexes to thousands of samples is computationally challenging due to the presence of multiple coordinate systems to avoid reference biases. We present VariantStore, ...

    Prashant Pandey, Yinjie Gao, Carl Kingsford in Genome Biology (2021)

  6. Article

    Open Access

    Exact transcript quantification over splice graphs

    The probability of sequencing a set of RNA-seq reads can be directly modeled using the abundances of splice junctions in splice graphs instead of the abundances of a list of transcripts. We call this model gra...

    Cong Ma, Hongyu Zheng, Carl Kingsford in Algorithms for Molecular Biology (2021)

  7. Article

    Open Access

    Harvestman: a framework for hierarchical feature learning and selection from whole genome sequencing data

    Supervised learning from high-throughput sequencing data presents many challenges. For one, the curse of dimensionality often leads to overfitting as well as issues with scalability. This can bring about inacc...

    Trevor S. Frisby, Shawn J. Baker, Guillaume Marçais, Quang Minh Hoang in BMC Bioinformatics (2021)

  8. Article

    Open Access

    Alignment and map** methodology influence transcript abundance estimation

    The accuracy of transcript quantification using RNA-seq data depends on many factors, such as the choice of alignment or map** method and the quantification model being adopted. While the choice of quantific...

    Avi Srivastava, Laraib Malik, Hirak Sarkar, Mohsen Zakeri in Genome Biology (2020)

  9. Article

    Open Access

    Context-aware seeds for read map**

    Most modern seed-and-extend NGS read mappers employ a seeding scheme that requires extracting t non-overlap** seeds in each read in order to find all valid map**s under an edit distance threshold of t. As t g...

    Hongyi **n, Mingfu Shao, Carl Kingsford in Algorithms for Molecular Biology (2020)

  10. Article

    Open Access

    Detecting transcriptomic structural variants in heterogeneous contexts via the Multiple Compatible Arrangements Problem

    Transcriptomic structural variants (TSVs)—large-scale transcriptome sequence change due to structural variation - are common in cancer. TSV detection from high-throughput sequencing data is a computationally c...

    Yutong Qiu, Cong Ma, Han **e, Carl Kingsford in Algorithms for Molecular Biology (2020)

  11. No Access

    Chapter and Conference Paper

    Lower Density Selection Schemes via Small Universal Hitting Sets with Short Remaining Path Length

    Universal hitting sets are sets of words that are unavoidable: every long enough sequence is hit by the set (i.e., it contains a word from the set). There is a tight relationship between universal hitting sets...

    Hongyu Zheng, Carl Kingsford in Research in Computational Molecular Biology (2020)

  12. Article

    Open Access

    Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads

    Single-molecule long-read sequencing has been used to improve mRNA isoform identification. However, not all single-molecule long reads represent full transcripts due to incomplete cDNA synthesis and sequencing...

    Laura H. Tung, Mingfu Shao, Carl Kingsford in Genome Biology (2019)

  13. Article

    Open Access

    Semi-nonparametric modeling of topological domain formation from epigenetic data

    Hi-C experiments capturing the 3D genome architecture have led to the discovery of topologically-associated domains (TADs) that form an important part of the 3D genome organization and appear to play a role in...

    Emre Sefer, Carl Kingsford in Algorithms for Molecular Biology (2019)

  14. Article

    Open Access

    SQUID: transcriptomic structural variation detection from RNA-seq

    Transcripts are frequently modified by structural variations, which lead to fused transcripts of either multiple genes, known as a fusion gene, or a gene and a previously non-transcribed sequence. Detecting th...

    Cong Ma, Mingfu Shao, Carl Kingsford in Genome Biology (2018)

  15. Article

    Open Access

    Kourami: graph-guided assembly for novel human leukocyte antigen allele discovery

    Accurate ty** of human leukocyte antigen (HLA) is important because HLA genes play important roles in immune responses and disease genesis. Previously available computational methods are database-matching ap...

    Heewook Lee, Carl Kingsford in Genome Biology (2018)

  16. No Access

    Protocol

    Accurate Assembly and Ty** of HLA using a Graph-Guided Assembler Kourami

    Accurate ty** of human leukocyte antigen (HLA) is essential for successful organ transplantation and HLA genes are heavily associated with various diseases. Widely used ty** assays often involve a set of s...

    Heewook Lee, Carl Kingsford in HLA Ty** (2018)

  17. No Access

    Article

    Accurate assembly of transcripts through phase-preserving graph decomposition

    Scallop improves identification of multi-exon and transcripts expressed at low levels by retaining phasing information during assembly.

    Mingfu Shao, Carl Kingsford in Nature Biotechnology (2017)

  18. No Access

    Article

    Salmon provides fast and bias-aware quantification of transcript expression

    Salmon is a computational tool that uses sample-specific models and a dual-phase inference procedure to correct biases in RNA-seq data and rapidly quantify transcript abundances.

    Rob Patro, Geet Duggal, Michael I Love, Rafael A Irizarry, Carl Kingsford in Nature Methods (2017)

  19. No Access

    Chapter and Conference Paper

    Improved Search of Large Transcriptomic Sequencing Databases Using Split Sequence Bloom Trees

    Enormous databases of short-read RNA-seq sequencing experiments such as the NIH Sequencing Read Archive (SRA) are now available. These databases could answer many questions about the condition-specific express...

    Brad Solomon, Carl Kingsford in Research in Computational Molecular Biology (2017)

  20. Article

    Open Access

    A pathway-centric view of spatial proximity in the 3D nucleome across cell lines

    In various contexts, spatially proximal genes have been shown to be functionally related. However, the extent to which spatial proximity of genes in a pathway contributes to the pathway’s context-specific acti...

    Hiren Karathia, Carl Kingsford, Michelle Girvan, Sridhar Hannenhalli in Scientific Reports (2016)

previous disabled Page of 3