randomInteractome.Rd
Retrieve molecular interactions for the random set of proteins (of a particular taxon)
randomInteractome(MITABdata = NULL, degree_data = NULL, n_prot, degree_dist = NULL, ...)
degree_data | data.table containing pre-calculated (using |
---|---|
n_prot | integer (1L), the number of proteins for which to retrieve the random set of interactions |
degree_dist | data.table, specifies the degree frequency () for each degree (N) to produce the network with the specific degree distribution, if set to NULL (default) the degree distribution will correspond to that of |
... | Arguments passed on to
|
list of two elements: 1. interactome - data.table containing molecular interaction data in either of these two formats:
if clean
is TRUE: contains columns as described in cleanMITAB
;
if clean
is FALSE: contains a standard set of columns for MITAB2.5 or MITAB2.7 depending on format
;
2. seed - character vector containing IDs of proteins used as a seed to retrieve molecular interactions
Not implemented: Random network can be specified to have specific degree distribution. If the (degree
parameter is set taxid
proteins will be split by degree and from each degree group a sample of the size specified by how many times specific degree number is repeated in degree
will be taken.
If the degree distribution is not specified a sample of n_prot
is taken from all proteins which have interaction data available in the database
for taxid
. In this case, the degree distribution of the resulting set of proteins will be similar to the degree distribution in the interactome of taxid
in database
.
randomInteractome
retrieves molecular interactions using fullInteractome
{ # retrive the interactome using PSICQIUC servise (or by reading local copy) from IntAct databases for a list of 200 random human (9606) proteins, not specifying their degree distribution #set.seed(1) # commented because too slow #random = randomInteractome(n_prot = 200, degree_dist = NULL, taxid = "9606", database = "imex", protein_only = TRUE) # retrive the interactome from MITABdata for a list of 200 random human (9606) proteins, not specifying their degree_dist distribution full = fullInteractome(taxid = "9606", database = "IntActFTP", format = "tab25", clean = TRUE, protein_only = TRUE) set.seed(1) random = randomInteractome(MITABdata = full, n_prot = 200, degree_dist = NULL) # retrive the interactome using PSICQIUC servise (or by reading local copy from a specified directory) from IntAct databases for a list of 200 random human (9606) proteins, not specifying their degree_dist distribution #set.seed(1) #random = randomInteractome(n_prot = 200, degree_dist = NULL, taxid = "9606", database = "IntActFTP", protein_only = TRUE, directory = "./data/") }#>#>#>#> Warning: Found and resolved improper quoting out-of-sample. First healed line 25383: <<uniprotkb:P16054 uniprotkb:Q05769 intact:EBI-298451 intact:EBI-298933|uniprotkb:Q543K3 psi-mi:kpce_mouse(display_long)|uniprotkb:Prkce(gene name)|psi-mi:Prkce(display_short)|uniprotkb:Pkce(gene name synonym)|uniprotkb:Pkcea(gene name synonym)|uniprotkb:nPKC-epsilon(gene name synonym) psi-mi:pgh2_mouse(display_long)|uniprotkb:Ptgs2(gene name)|psi-mi:Ptgs2(display_short)|uniprotkb:Cox-2(gene name synonym)|uniprotkb:Cox2(gene name synonym)|uniprotkb:Pghs-b(gene name synonym)|uniprotkb:Tis10(gene na>>. If the fields are not quoted (e.g. field separator does not appear within any field), try quote="" to avoid this warning.