Skip to content

Fudenberg-Research-Group/OccupancyInputCTCF

Repository files navigation

OccupancyInputCTCF

Description

This GitHub repository provides tools for training machine learning models to predict 3D chromatin architecture from single-molecule footprinting data (e.g., methylation patterns). It integrates sequence features and occupancy profiles to infer genome folding and validates predictions by comparing Hi-C data with simulated chromatin loop extrusion, incorporating locus-specific occupancy rates and dynamic CTCF barriers.

Workflow Figure

Structure of the repository

The structure of this repository follows as below:

  • output : files after processing and analyzing the input data.
  • analysis: notebooks and code for analyzing simulations and experimental data.
  • utils: necessary functions and tools for performing workflow
  • models: machine learning models for predicting CTCF occupancy rate.

Requirements

Installation

First,

git clone https://github.com/Fudenberg-Research-Group/OccupancyInputCTCF.git

Workflow

Running simulations

  1. One-Dimensional Lattice Simulation: with running workflow.py

Processing simulation data

After running the simulations, the simulated trajectories can be processed to generate in silico ChIP-seq profiles, 1d contact maps, and 3d contact maps (optional). Scripts for data processing available in processing. Instructions are provided with the relevant python code.

Analysis

Once the data is processed, observable features can be quantified

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages