verification
build_structure_peptides_comparison(structure, peptides)
Compares residue numbering and identity between a structure and peptides.
dictionary with:
Type | Description |
---|---|
DataFrame
|
|
DataFrame
|
|
DataFrame
|
|
Source code in hdxms_datasets/verification.py
113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 |
|
compare_structure_peptides(structure, peptides, returns='dict')
compare_structure_peptides(
structure: Structure,
peptides: Peptides,
returns: Literal["dict"] = "dict",
) -> CompareSummary
compare_structure_peptides(
structure: Structure,
peptides: Peptides,
returns: Literal["df"],
) -> pl.DataFrame
compare_structure_peptides(
structure: Structure,
peptides: Peptides,
returns: Literal["both"],
) -> tuple[CompareSummary, pl.DataFrame]
Compares structure and peptide data.
Source code in hdxms_datasets/verification.py
195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 |
|
datafiles_exist(dataset)
Check if the data files for all peptides and structures in the dataset exist.
Source code in hdxms_datasets/verification.py
38 39 40 41 42 43 44 45 46 47 48 |
|
residue_df_from_peptides(peptides)
Create a dataframe from the peptides with resi, resn_TLA
Source code in hdxms_datasets/verification.py
85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 |
|
residue_df_from_structure(structure)
Create a dataframe from the structure with chain, resi, resn_TLA
Source code in hdxms_datasets/verification.py
51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 |
|
summarize_compare_table(df)
Derive the metrics from the merged table.
Source code in hdxms_datasets/verification.py
156 157 158 159 160 161 162 163 164 165 166 167 168 |
|
verify_dataset(dataset)
Verify the integrity of the dataset by checking sequences and data files.
Source code in hdxms_datasets/verification.py
10 11 12 13 14 15 16 17 |
|
verify_peptides(dataset)
Verify that all peptide sequences match the protein sequence in the dataset states.
Source code in hdxms_datasets/verification.py
20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 |
|