Algorithms for causal learning and comparative analysis for genomic data
Machnik NN. 2024. Algorithms for causal learning and comparative analysis for genomic data. Institute of Science and Technology Austria.
Download
              
            
            
            
            Thesis
            
            
            |  PhD
            
            
            | Published
            
            
              |              English
              
            
          
        Author
        Supervisor
        Corresponding author has ISTA affiliation
Department
    Series Title
    
    ISTA Thesis
Abstract
    This thesis consists of two pieces of work in the broader feld of computational biology,
both of which are methods for the analysis of large scale biological data, implemented in
efcient software.
Chapter 2 introduces a statistical software for causal discovery and inference from observed
genetic marker and phenotypic trait data. We explore in simulation how well the method
can fne-map genetic efects, fnd the correct causal structure among tens of traits and
millions of genetic markers, and infer the causal efect size for the discovered causal
relations. We then apply the method to 8 million markers and 17 traits from the UK
Biobank and show that many relationships found with other methods are likely due to
the efects of hidden confounders.
Chapter 3 describes how this method can be applied to longitudinal data. I show how one
can incorporate the background knowledge present in the known order of measurements to
improve the accuracy of the causal discovery process, and explore the method’s ability to
identify age specifc genetic efects, and how the error rates of this recovery are infuenced
by missing data due to diferent censoring mechanisms.
Chapter 4 introduces a statistical software for the comparison of chromatin contact maps
based on the structural similarity index. We explore the robustness of the method to
noise and size diferences of the compared maps, show how it can measure evolutionary
conservation of topological features by providing a similarity ranking of syntenic regions,
and fnally how it can detect alterations in 3D genome structure due to genetic mutations
in samples of medical relevance.
    
  Publishing Year
    
  Date Published
    2024-12-11
  Publisher
    Institute of Science and Technology Austria
  Acknowledgement
    I would like to thank the Swiss National Science Foundation for funding parts of this work
through the Eccellenza Grant "Improving estimation and prediction of common complex
disease risk" with grant number PCEGP3_181181.
  Page
      138
    ISSN
    
  IST-REx-ID
    
  Cite this
Machnik NN. Algorithms for causal learning and comparative analysis for genomic data. 2024. doi:10.15479/at:ista:18642
    Machnik, N. N. (2024). Algorithms for causal learning and comparative analysis for genomic data. Institute of Science and Technology Austria. https://doi.org/10.15479/at:ista:18642
    Machnik, Nick N. “Algorithms for Causal Learning and Comparative Analysis for Genomic Data.” Institute of Science and Technology Austria, 2024. https://doi.org/10.15479/at:ista:18642.
    N. N. Machnik, “Algorithms for causal learning and comparative analysis for genomic data,” Institute of Science and Technology Austria, 2024.
    Machnik NN. 2024. Algorithms for causal learning and comparative analysis for genomic data. Institute of Science and Technology Austria.
    Machnik, Nick N. Algorithms for Causal Learning and Comparative Analysis for Genomic Data. Institute of Science and Technology Austria, 2024, doi:10.15479/at:ista:18642.
  
      All files available under the following license(s):
      
      
        
          
        
          
          
      
      
    
  
            Copyright Statement:
          
        
            This Item is protected by copyright and/or related rights. [...]
          
        
      Main File(s)
    
  File Name
    
        
          
          
            NickMachnikThesisFinal_pdfa_conv.pdf
          
        
       12.85 MB
    
  Access Level
     Open Access
 Open Access
    Date Uploaded
    
      2024-12-11
    
  Embargo End Date
    
      2025-06-12
    
  MD5 Checksum
    
      d45e4d170f9a70a1f69b44b99bd058e4
    
  
      Source File
    
  File Name
    
      
        thesis.zip
       14.19 MB
    
  Access Level
     Closed Access
 Closed Access
    Date Uploaded
    
      2024-12-11
    
  MD5 Checksum
    f88c9acc62002395ec4dcbdb5eea8b82
  
      Material in ISTA:
    
  
      Part of this Dissertation
    
  
      Part of this Dissertation
    
  

 Google Scholar
Google Scholar