Building a 3D genome atlas of human tissues down to base-pair resolutions

dc.contributor.advisorAiden, Aviva P
dc.contributor.advisorGrande-Allen, K. Jane
dc.creatorShamim, Muhammad Saad
dc.date.accessioned2023-06-13T15:51:36Z
dc.date.created2023-05
dc.date.issued2023-03-21
dc.date.submittedMay 2023
dc.date.updated2023-06-13T15:51:36Z
dc.descriptionEMBARGO NOTE: This item is embargoed until 2025-05-01
dc.description.abstractThe three-dimensional organization of the genome affects cell function and can be interrogated in an unbiased manner via methods such as Hi-C, a genome-wide proximity ligation assay. Hi-C assays have led to a deeper understanding of the mechanisms underlying chromosome conformation, such as chromatin loops and compartments. Improvements to the Hi-C protocol have made it possible to generate meaningful contact maps down to base-pair-resolutions, comparable to the types of resolutions used to analyze most epigenetics assays, such as ChIP-Seq. Generating these ultra-high-resolution maps, however, requires terabases of DNA sequencing. We developed an ecosystem of open-source software tools to support the analysis and visualization of ultra-deep Hi-C datasets. These tools include the Juicer 2.0 pipeline for processing billions of DNA sequencing reads into contact maps at base-pair resolutions; the straw library for powering rapid programmatic access to Hi-C data from both local and remote files; a novel ensemble deep learning approach to annotate chromatin loops; and a novel algorithm that combines dimensionality reduction with unsupervised learning to reliably identify genomic subcompartments. Together, these tools enable the comprehensive processing and analysis of Hi-C data to unprecedented resolutions. We then applied these tools to generate and fully annotate 10-base-pair-resolution maps of nuclear architecture in over 100 primary samples and cell lines, spanning over 40 unique human tissues. Taken together, these datasets report over 100 terabases of raw sequence data, revealing hundreds of thousands of DNA loops localized down to 10-base-pair resolution, and help elucidate the function of genome architecture across human anatomy.
dc.embargo.lift2025-05-01
dc.embargo.terms2025-05-01
dc.format.mimetypeapplication/pdf
dc.identifier.citationShamim, Muhammad Saad. "Building a 3D genome atlas of human tissues down to base-pair resolutions." (2023) Diss., Rice University. <a href="https://hdl.handle.net/1911/114904">https://hdl.handle.net/1911/114904</a>.
dc.identifier.urihttps://hdl.handle.net/1911/114904
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subject3D Genomics
dc.subjectHi-C
dc.subjectChromatin Loops
dc.subjectSubcompartments
dc.subjectGenome Architecture
dc.titleBuilding a 3D genome atlas of human tissues down to base-pair resolutions
dc.typeThesis
dc.type.materialText
thesis.degree.departmentBioengineering
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy
Files
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.61 KB
Format:
Plain Text
Description: