Skip to content

Commit 7ba2c46

Browse files
authored
Merge pull request #3 from amorehead/develop
Develop to Master Checkpoint
2 parents 8820aed + ba789d7 commit 7ba2c46

17 files changed

+1869
-29
lines changed

.gitignore

+7-1
Original file line numberDiff line numberDiff line change
@@ -131,4 +131,10 @@ project/datasets/DB5/final/final_raw_db5.tar.gz*
131131
project/datasets/EVCoupling/raw/**
132132
project/datasets/EVCoupling/interim/**
133133
project/datasets/EVCoupling/final/raw/**
134-
project/datasets/EVCoupling/final/processed/**
134+
project/datasets/EVCoupling/final/processed/**
135+
136+
# CASP-CAPRI
137+
project/datasets/CASP-CAPRI/raw/**
138+
project/datasets/CASP-CAPRI/interim/**
139+
project/datasets/CASP-CAPRI/final/raw/**
140+
project/datasets/CASP-CAPRI/final/processed/**

README.md

-1
Original file line numberDiff line numberDiff line change
@@ -203,7 +203,6 @@ tar -xzf project/datasets/DB5.tar.gz --directory project/datasets/
203203

204204
# Remove (now) redundant DB5 archive and other miscellaneous files:
205205
rm project/datasets/DB5.tar.gz project/datasets/DB5/.README.swp
206-
rm project/datasets/DB5.tar.gz project/datasets/DB5/.README.swp
207206
rm -rf project/datasets/DB5/interim project/datasets/DB5/processed
208207

209208
# Create relevant interim and final data directories:

project/datasets/DB5/README

+11-12
Original file line numberDiff line numberDiff line change
@@ -1,25 +1,24 @@
11
Cleaned up version of Docking Benchmark 5 (https://zlab.umassmed.edu/benchmark/).
22

3-
Originally released with "End-to-End Learning on 3D Protein Structure for Interface Prediction."
3+
Released with "End-to-End Learning on 3D Protein Structure for Interface Prediction."
44
by Raphael J.L. Townshend, Rishi Bedi, Patricia Suriana, Ron O. Dror
55
https://arxiv.org/abs/1807.01297
66

77
Specifically, bound chains and residue indexes were aligned across unbound and bound complexes.
88

99
A total of 230 binary protein complexes are included.
1010

11-
Processing code to regenerate and use the original provided tfrecords is located at
11+
Processing code to regenerate and use the provided tfrecords is located at
1212
https://github.com/drorlab/DIPS
1313

14-
final/
15-
raw/ - List of all postprocessed (i.e. feature-enriched) pairs.
14+
MANIFEST
1615

16+
raw/ - All pre-aligned and cleaned DB5 structures, organized into directories
17+
with individual files for ligand-unbound, ligand-bound, receptor-unbound,
18+
receptor-bound.
1719
interim/
18-
complexes/ - List of all possible pairs in parsed.
19-
external_feats/ - List of all residue-level features generated using external software packages.
20-
pairs/ - Dill files of individual pairs listed in complexes.
21-
parsed/ - All DB5 structures processed to pickled dataframes.
22-
23-
raw/ - All pre-aligned and cleaned DB5 structures, organized into directories
24-
with individual files for ligand-unbound, ligand-bound, receptor-unbound,
25-
receptor-bound.
20+
parsed/ - All DB5 structures processed to pickled dataframes.
21+
complexes/ - List of all possible pairs in parsed.
22+
pairs/ - Dill files of indivudal pairs listed in complexes.
23+
processed/
24+
tfrecords/ - pairs converted to tfrecords.

0 commit comments

Comments
 (0)