File size: 3,612 Bytes
2676cd2 ad760ef dca7b00 b44075a dca7b00 b44075a dca7b00 b44075a dca7b00 b44075a dca7b00 b44075a dca7b00 ad760ef dca7b00 b44075a | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 | ---
license: cc-by-nc-nd-4.0
---
# Directory Structure
```
.
βββ README.md
βββ dpacman
βΒ Β βββ data
βΒ Β βΒ Β βββ README.md
βΒ Β βΒ Β βββ chip_atlas
βΒ Β βΒ Β βΒ Β βββ full_data_loading.py
βΒ Β βΒ Β βΒ Β βββ smaller_data_loading.py
βΒ Β βΒ Β βββ remap
βΒ Β βΒ Β βΒ Β βββ analyze.py
βΒ Β βΒ Β βββ tfclust
βΒ Β βΒ Β βββ analyze.py
βΒ Β βΒ Β βββ api_download.py
βΒ Β βΒ Β βββ combine.py
βΒ Β βΒ Β βββ download.py
βΒ Β βΒ Β βββ figures
βΒ Β βΒ Β βΒ Β βββ seq_lengths_box.png
βΒ Β βΒ Β βΒ Β βββ seq_lengths_flanked_box.png
βΒ Β βΒ Β βΒ Β βββ seq_lengths_flanked_hist.png
βΒ Β βΒ Β βΒ Β βββ seq_lengths_flanked_xlog_box.png
βΒ Β βΒ Β βΒ Β βββ seq_lengths_flanked_xlog_hist.png
βΒ Β βΒ Β βΒ Β βββ seq_lengths_hist.png
βΒ Β βΒ Β βΒ Β βββ seq_lengths_xlog_box.png
βΒ Β βΒ Β βΒ Β βββ seq_lengths_xlog_hist.png
βΒ Β βΒ Β βββ hg38_success_download.log
βΒ Β βββ data_files
βΒ Β βββ processed
βΒ Β βΒ Β βββ tfclust
βΒ Β βΒ Β βββ hg19
βΒ Β βΒ Β βΒ Β βββ encRegTfbsClustered_hg19_chr1.csv
βΒ Β βΒ Β βΒ Β βββ logs
βΒ Β βΒ Β βΒ Β βββ completed.txt
βΒ Β βΒ Β βΒ Β βββ completed_worker_0.txt
βΒ Β βΒ Β βΒ Β βββ worker_0.log
βΒ Β βΒ Β βββ hg38
βΒ Β βΒ Β βββ encRegTfbsClustered_hg38_chr1.csv
βΒ Β βΒ Β βββ logs
βΒ Β βΒ Β βββ completed.txt
βΒ Β βΒ Β βββ completed_worker_0.txt
βΒ Β βΒ Β βββ worker_0.log
βΒ Β βββ raw
βΒ Β βββ chip_atlas
βΒ Β βΒ Β βββ experimentList.tab
βΒ Β βββ genomes
βΒ Β βΒ Β βββ hg19
βΒ Β βΒ Β βΒ Β βββ hg19_chr1.json
βΒ Β βΒ Β βββ hg38
βΒ Β βΒ Β βββ hg38_chr1.json
βΒ Β βββ remap
βΒ Β βΒ Β βββ reMap2022.bb
βΒ Β βΒ Β βββ reMap2022.bed
βΒ Β βΒ Β βββ remap2022_all_macs2_hg38_v1_0.bed.gz
βΒ Β βΒ Β βββ remap2022_crm_macs2_hg38_v1_0.bed
βΒ Β βββ tfclust
βΒ Β βββ encRegTfbsClusteredWithCells.hg19.bed
βΒ Β βββ encRegTfbsClusteredWithCells.hg38.bed
βΒ Β βββ encRegTfbsClustered_data
βΒ Β βββ hg19
βΒ Β βΒ Β βββ hg19_encRegTfbsClustered_chr1.json
βΒ Β βββ hg38
βΒ Β βββ hg38_encRegTfbsClustered_chr1.json
βββ environment.yaml
βββ setup.py
βββ tree_output.txt
```
20 directories, 3089 files
In `data_files` subfolders, only representative files for certain chromosomes are shown. In reality, any file that contains the substring "_chr" exists for every chromosome in that genome. Genome hg38 has 711 chromosomes. Genome hg19 has 298 chromosomes. To reconstruct a full directory structure, run the following from `DPACMAN`
```
tree -I '__pycache__|*.egg-info|*.git' > tree.txt
``` |