utils
check_sequence(peptides, known_sequence, n_term=1)
Check the sequence of peptides against the given sequence.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
peptides
|
DataFrame
|
DataFrame containing peptide information. |
required |
sequence
|
The original sequence to check against. |
required | |
n_term
|
int
|
The number of N-terminal residues to consider. |
1
|
Returns:
Type | Description |
---|---|
list[tuple[int, str, str]]
|
A tuple containing the fixed sequence and a list of mismatches. |
Source code in hdxms_datasets/utils.py
43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 |
|
default_protein_info(peptides)
Generate minimal protein info from a set of peptides
Source code in hdxms_datasets/utils.py
70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 |
|
reconstruct_sequence(peptides, known_sequence, n_term=1)
Reconstruct the sequence form a dataframe of peptides with sequence information. The sequence is reconstructed by replacing the known sequence with the peptide sequences at the specified start and end positions.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
peptides
|
DataFrame
|
DataFrame containing peptide information. |
required |
known_sequence
|
str
|
Starting sequence. Can be a string 'X' as placeholder. |
required |
n_term
|
int
|
The residue number of the N-terminal residue. This is typically 1, can be negative in case of purification tags. |
1
|
Returns:
Type | Description |
---|---|
str
|
The reconstructed sequence. |
Source code in hdxms_datasets/utils.py
14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 |
|