disgenet package
Submodules
disgenet.da_disgenet module
Module containing the BioBBs DisGeNET class and the command line interface.
- class disgenet.da_disgenet.DADisgenet(output_file_path, retrieve_by=None, properties=None, **kwargs)[source]
Bases:
BiobbObjectbiobb_disgenet Disease Attribute DisgenetThis class is for downloading a Disease Attribute request from DisGeNET database.Wrapper for the DisGeNET database https://www.disgenet.org and the DisGeNET REST API ´https://www.disgenet.org/api/´ for downloading available collections of genes and variants associated data to human diseases.- Parameters
retrieve_by (str) (Optional) – Configuration params to pass for the retrieval of the association on the REST API, not mandatory, in default is obtained by gene ID (uniprot_entry, source)
output_file_path (str) – Path to the output file, that can be in format TSV, JSON or XML.
properties (dict - Python dict containing the properties for the API interrogation, considering also the credentials of the user to the database) –
diseaseName (str) - (None) Disease name recognized by the database.
disease_id (str) - (None) Disease id or a list of disease ids separated by commas.
source (str) - (“ALL”) Source of the associations (CURATED, INFERRED, ANIMAL_MODELS, ALL, BEFREE, CGI, CLINGEN, CLINVAR, CTD_human, CTD_mouse, CTD_rat, GENOMICS_ENGLAND, GWASCAT, GWASDB, HPO, LHGDN, MGD, ORPHANET, PSYGENET, RGD, UNIPROT).
max_dsi (str) - (None) Max value of DSI range for the gene.
min_dsi (str) - (None) Min value of DSI range for the gene.
min_dpi (str) - (None) Min value of DPI range for the gene.
max_dpi (str) - (None) Max value of DPI range for the gene.
max_pli (str) - (None) Max value of pLI range for the gene.
min_pli (str) - (None) Min value of pLI range for the gene.
format (str) - (“json”) Format output file.
limit (str) - (None) Number of disease to retrieve.
remove_tmp (bool) - (True) [WF property] Remove temporal files.
restart (bool) - (False) [WF property] Do not execute if output files exist.
container_path (str) - (None) Path to the binary executable of your container.
container_image (str) - (None) Container Image identifier.
container_volume_path (str) - (“/data”) Path to an internal directory in the container.
container_working_dir (str) - (None) Path to the internal CWD in the container.
container_user_id (str) - (None) User number id to be mapped inside the container.
container_shell_path (str) - (“/bin/bash”) Path to the binary executable of the container shell.
Examples
This is a use example of how to use the building block from Python:
from biobb_disgenet.disgenet.DADisgenet import DADisgenet
- prop = {
‘disease_id’:’disease_id’, ‘source’: ‘source’, ‘min_dsi’: ‘min_dsi’, ‘max_dsi’: ‘max_dsi’, ‘min_dpi’: ‘min_dpi’, ‘max_dpi’: ‘max_dpi’, ‘min_pli’: ‘min_pli’, ‘max_pli’:’max_pli’, ‘format’: ‘format’, ‘limit’: ‘limit’
} DADisgenet(
retrieve_by=’source’, output_file_path=’/path/to/associationsFile’, properties=prop)
- Info:
- retrieve_by can be:
disease, diseaseName, source, mappings, similarity
- launch() int[source]
Execute the
DADisgenetobject.
- disgenet.da_disgenet.da_disgenet(output_file_path: str, retrieve_by: Optional[str] = None, properties: Optional[dict] = None, **kwargs) int[source]
Create
DADisgenetclass and execute thelaunch()method.
disgenet.dda_disgenet module
Module containing the BioBBs DisGeNET class and the command line interface.
- class disgenet.dda_disgenet.DDADisgenet(retrieve_by, output_file_path, properties=None, **kwargs)[source]
Bases:
BiobbObjectbiobb_disgenet Disease Association DisgenetThis class is for downloading a Disease Disease Associations file from DisGeNET database.Wrapper for the DisGeNET database https://www.disgenet.org and the DisGeNET REST API ´https://www.disgenet.org/api/´ for downloading available collections of genes and variants associated data to human diseases.- Parameters
shared_by (str) – Configuration params to pass for the retrieval of the association on the REST API (gene, uniprot_entry, disease, source, evidences_gene, evidences_disease)
output_file_path (str) – Path to the output file, that can be in format TSV, JSON or XML.
properties (dict - Python dict containing the properties for the API interrogation, considering also the credentials of the user to the database) –
source (str) - (“ALL”) Source of the associations (CURATED, INFERRED, ANIMAL_MODELS, ALL, BEFREE, CGI, CLINGEN, CLINVAR, CTD_human, CTD_mouse, CTD_rat, GENOMICS_ENGLAND, GWASCAT, GWASDB, HPO, LHGDN, MGD, ORPHANET, PSYGENET, RGD, UNIPROT).
disease_vocabulary (str) - Disease vocabulary (icd9cm, icd10, mesh, omim, do, efo, nci, hpo, mondo, ordo).
pvalue (str) - (None) Pvalue of the disease-disease score range.
format (str) - (“json”) Format output file.
limit (str) - (“10”) Number of disease to retrieve.
remove_tmp (bool) - (True) [WF property] Remove temporal files.
restart (bool) - (False) [WF property] Do not execute if output files exist.
container_path (str) - (None) Path to the binary executable of your container.
container_image (str) - (None) Container Image identifier.
container_volume_path (str) - (“/data”) Path to an internal directory in the container.
container_working_dir (str) - (None) Path to the internal CWD in the container.
container_user_id (str) - (None) User number id to be mapped inside the container.
container_shell_path (str) - (“/bin/bash”) Path to the binary executable of the container shell.
Examples
This is a use example of how to use the building block from Python:
from biobb_disgenet.disgenet.dda_disgenet import dda_disgenet
- prop = {
‘disease_id’: ‘C0002395’, ‘source’: ‘source’, ‘pvalue’:’pvalue’, ‘vocabulary’:’vocabulary’, ‘format’: ‘format’, ‘limit’: ‘limit’
} dda_disgenet(shared_by=’genes’,
output_file_path=’/path/to/associationsFile’, properties=prop)
Info:
- launch() int[source]
Execute the
DDADisgenetobject.
- disgenet.dda_disgenet.dda_disgenet(retrieve_by: str, output_file_path: str, properties: Optional[dict] = None, **kwargs) int[source]
Create
DDADisgenetclass and execute thelaunch()method.
disgenet.ga_disgenet module
Module containing the BioBBs DisGeNET class and the command line interface.
- class disgenet.ga_disgenet.GADisgenet(output_file_path, retrieve_by=None, properties=None, **kwargs)[source]
Bases:
BiobbObjectbiobb_disgenet Gene Attribute DisgenetThis class is for downloading a Gene Attribute request from DisGeNET database.Wrapper for the DisGeNET database https://www.disgenet.org and the DisGeNET REST API ´https://www.disgenet.org/api/´ for downloading available collections of genes and variants associated data to human diseases.- Parameters
retrieve_by (str) – Configuration params to pass for the retrieval of the association on the REST API (gene, uniprot_entry, disease, source, evidences_gene, evidences_disease)
output_file_path (str) – Path to the output file, that can be in format TSV, JSON or XML.
properties (dict - Python dict containing the properties for the API interrogation, considering also the credentials of the user to the database) –
source (str) - (“ALL”) Source of the associations (CURATED, INFERRED, ANIMAL_MODELS, ALL, BEFREE, CGI, CLINGEN, CLINVAR, CTD_human, CTD_mouse, CTD_rat, GENOMICS_ENGLAND, GWASCAT, GWASDB, HPO, LHGDN, MGD, ORPHANET, PSYGENET, RGD, UNIPROT).
max_dsi (str) - (None) Max value of DSI range for the gene.
min_dsi (str) - (None) Min value of DSI range for the gene.
min_dpi (str) - (None) Min value of DPI range for the gene.
max_dpi (str) - (None) Max value of DPI range for the gene.
max_pli (str) - (None) Max value of pLI range for the gene.
min_pli (str) - (None) Min value of pLI range for the gene.
format (str) - (“json”) Format output file.
limit (str) - (None) Number of disease to retrieve.
remove_tmp (bool) - (True) [WF property] Remove temporal files.
restart (bool) - (False) [WF property] Do not execute if output files exist.
container_path (str) - (None) Path to the binary executable of your container.
container_image (str) - (None) Container Image identifier.
container_volume_path (str) - (“/data”) Path to an internal directory in the container.
container_working_dir (str) - (None) Path to the internal CWD in the container.
container_user_id (str) - (None) User number id to be mapped inside the container.
container_shell_path (str) - (“/bin/bash”) Path to the binary executable of the container shell.
Examples
This is a use example of how to use the building block from Python:
from biobb_disgenet.disgenet.ga_disgenet import ga_disgenet
- prop = {
‘gene_id’: ‘351’, ‘source’: ‘source’, ‘min_dsi’: ‘min_dsi’, ‘max_dsi’: ‘max_dsi’, ‘min_dpi’: ‘min_dpi’, ‘max_dpi’: ‘max_dpi’, ‘min_pli’: ‘min_pli’, ‘max_pli’:’max_pli’, ‘format’: ‘format’, ‘limit’: ‘limit’
} ga_disgenet(
retrieve_by=’gene’ output_file_path=’/path/to/associationsFile’, properties=prop)
- Info:
- retrieve_by can be:
gene, uniprot, source
- launch() int[source]
Execute the
GADisgenetobject.
- disgenet.ga_disgenet.ga_disgenet(output_file_path: str, retrieve_by: Optional[str] = None, properties: Optional[dict] = None, **kwargs) int[source]
Create
GADisgenetclass and execute thelaunch()method.
disgenet.gda_disgenet module
Module containing the BioBBs DisGeNET class and the command line interface.
- class disgenet.gda_disgenet.GDADisgenet(retrieve_by, output_file_path, properties=None, **kwargs)[source]
Bases:
BiobbObjectbiobb_disgenet Gene Disease Association DisgenetThis class is for downloading a Gene Disease Associations file from DisGeNET database.Wrapper for the DisGeNET database https://www.disgenet.org and the DisGeNET REST API ´https://www.disgenet.org/api/´ for downloading available collections of genes and variants associated data to human diseases.- Parameters
retrieve_by (str) – Configuration params to pass for the retrieval of the association on the REST API (gene, uniprot_entry, disease, source, evidences_gene, evidences_disease)
output_file_path (str) – Path to the output file, that can be in format TSV, JSON or XML.
properties (dict - Python dict containing the properties for the API interrogation, considering also the credentials of the user to the database) –
gene_id (str) - Number identification for a gene or a list of genes separated by commas recognized by the database.
disease (str) - Disease id or a list of disease separated by commas.
disease_type (str) - Disease, phenotype, group.
source (str) - Source of the associations (CURATED, INFERRED, ANIMAL_MODELS, ALL, BEFREE, CGI, CLINGEN, CLINVAR, CTD_human, CTD_mouse, CTD_rat, GENOMICS_ENGLAND, GWASCAT, GWASDB, HPO, LHGDN, MGD, ORPHANET, PSYGENET, RGD, UNIPROT).
disease_vocabulary (str) - Disease vocabulary (icd9cm, icd10, mesh, omim, do, efo, nci, hpo, mondo, ordo).
disease_class * - MeSh disease classes.
min_score (str) - Min value of the gene-disease score range.
max_score (str) - Max value of the gene-disease score range.
min_ei (str) - Min value of evidence index score range.
max_ei (str) - Max value of evidence index score range.
max_dsi (str) - Max value of DSI range for the gene.
min_dsi (str) - Min value of DSI range for the gene.
max_pli (str) - Max value of pLI range for the gene.
min_pli (str) - Min value of pLI range for the gene.
format (str) - Format output file.
limit (str) - Number of GDAs to retrieve.
min_year (str) - The year of the earliest publications.
max_year (str) - The year of the latest publicatons.
offset (str) - Starting offset of the page.
remove_tmp (bool) - (True) [WF property] Remove temporal files.
restart (bool) - (False) [WF property] Do not execute if output files exist.
Examples
This is a use example of how to use the building block from Python:
from biobb_disgenet.disgenet.gda_disgenet import gda_disgenet
- prop = {
‘gene_id’: ‘gene_id’, ‘disease_id’: ‘disease_id’, ‘uniprot_id’: ‘uniprot_id’, ‘source’: ‘source’, ‘min_score’: ‘min_score’, ‘max_score’: ‘max_score’, ‘min_ei’: ‘min_ei’, ‘max_ei’: ‘max_ei’, ‘type’: ‘disease_type’, ‘disease_class’: ‘disease_class’, ‘min_dsi’: ‘min_dsi’, ‘max_dsi’: ‘max_dsi’, ‘min_dpi’: ‘min_dpi’, ‘max_dpi’: ‘max_dpi’, ‘min_pli’: ‘min_pli’, ‘max_pli’:’max_pli’, ‘format’: ‘format’, ‘limit’: ‘limit’, ‘min_year’:’min_year’, ‘max_year’:’max_year’, ‘offset’:’offset’
} gda_disgenet(retrieve_by=’gene’,
output_file_path=’output_gene_351’, properties=prop)
Info:
- retrieve_by can be:
gene, uniprot, source, evidences
- launch() int[source]
Execute the
GDADisgenetobject.
- disgenet.gda_disgenet.gda_disgenet(retrieve_by: str, output_file_path: str, properties: Optional[dict] = None, **kwargs) int[source]
Create
GDADisgenetclass and execute thelaunch()method.
disgenet.va_disgenet module
Module containing the BioBBs DisGeNET class and the command line interface.
- class disgenet.va_disgenet.VADisgenet(output_file_path, retrieve_by=None, properties=None, **kwargs)[source]
Bases:
BiobbObjectbiobb_disgenet Variant Attribute DisgenetThis class is for downloading a Variant Attribute request from DisGeNET database.Wrapper for the DisGeNET database https://www.disgenet.org and the DisGeNET REST API ´https://www.disgenet.org/api/´ for downloading available collections of genes and variants associated data to human diseases.- Parameters
retrieve_by (str) – Configuration params to pass for the retrieval of the association on the REST API, not mandatory, in default is obtained by gene ID (uniprot_entry, source)
output_file_path (str) – Path to the output file, that can be in format TSV, JSON or XML.
properties (dict - Python dict containing the properties for the API interrogation, considering also the credentials of the user to the database) –
gene_id (str) - Number identification for a gene or a list of genes separated by commas recognized by the database.
variant_id (str) - Uniprot id or a list of variant ids separated by commas.
source (str) - Source of the associations (CURATED, INFERRED, ANIMAL_MODELS, ALL, BEFREE, CGI, CLINGEN, CLINVAR, CTD_human, CTD_mouse, CTD_rat, GENOMICS_ENGLAND, GWASCAT, GWASDB, HPO, LHGDN, MGD, ORPHANET, PSYGENET, RGD, UNIPROT).
max_dsi (str) - Max value of DSI range for the gene.
min_dsi (str) - Min value of DSI range for the gene.
min_dpi (str) - Min value of DPI range for the gene.
max_dpi (str) - Max value of DPI range for the gene.
max_pli (str) - Max value of pLI range for the gene.
min_pli (str) - Min value of pLI range for the gene.
format (str) - Format output file.
limit (str) - Number of disease to retrieve.
remove_tmp (bool) - (True) [WF property] Remove temporal files.
restart (bool) - (False) [WF property] Do not execute if output files exist.
Examples
This is a use example of how to use the building block from Python:
from biobb_disgenet.disgenet.ga_disgenet import ga_disgenet
- prop = {
‘variant_id’: ‘variant_id’, ‘source’: ‘source’, ‘min_dsi’: ‘min_dsi’, ‘max_dsi’: ‘max_dsi’, ‘min_dpi’: ‘min_dpi’, ‘max_dpi’: ‘max_dpi’, ‘min_pli’: ‘min_pli’, ‘max_pli’:’max_pli’, ‘format’: ‘format’, ‘limit’: ‘limit’
} va_disgenet(
retrieve_by=”variant”, output_file_path=’/path/to/associationsFile’, properties=prop)
Info:
- launch() int[source]
Execute the
VADisgenetobject.
- disgenet.va_disgenet.main()[source]
Command line execution of this building block. Please check the command line documentation.
- disgenet.va_disgenet.va_disgenet(output_file_path: str, retrieve_by: Optional[str] = None, properties: Optional[dict] = None, **kwargs) int[source]
Create
VADisgenetclass and execute thelaunch()method.
disgenet.vda_disgenet module
Module containing the BioBBs DisGeNET class and the command line interface.
- class disgenet.vda_disgenet.VDADisgenet(retrieve_by, output_file_path, properties=None, **kwargs)[source]
Bases:
BiobbObjectbiobb_disgenet Variant Association DisgenetThis class is for downloading a Varinat Disease Associations file from DisGeNET database.Wrapper for the DisGeNET database https://www.disgenet.org and the DisGeNET REST API ´https://www.disgenet.org/api/´ for downloading available collections of genes and variants associated data to human diseases.- Parameters
retrieve_by (str) – Configuration params to pass for the retrieval of the association on the REST API (variant, disease, source)
output_file_path (str) – Path to the output file, that can be in format TSV, JSON or XML.
properties (dict - Python dict containing the properties for the API interrogation, considering also the credentials of the user to the database) –
gene_id (str) - Number identification for a gene or a list of genes separated by commas recognized by the database.
variant_id (*str) - Variant id for the gene or a list of variant ids separated by commas recognized by the database.
disease_id (str) - Disease id or a list of disease separated by commas.
type (str) - Disease, phenotype, group.
source (str) - Source of the associations (CURATED, INFERRED, ANIMAL_MODELS, ALL, BEFREE, CGI, CLINGEN, CLINVAR, CTD_human, CTD_mouse, CTD_rat, GENOMICS_ENGLAND, GWASCAT, GWASDB, HPO, LHGDN, MGD, ORPHANET, PSYGENET, RGD, UNIPROT).
disease_class * - MeSh disease classes.
min_score (str) - Min value of the gene-disease score range.
max_score (str) - Max value of the gene-disease score range.
min_ei (str) - Min value of evidence index score range.
max_ei (str) - Max value of evidence index score range.
max_dsi (str) - Max value of DSI range for the gene.
min_dsi (str) - Min value of DSI range for the gene.
max_pli (str) - Max value of pLI range for the gene.
min_pli (str) - Min value of pLI range for the gene.
format (str) - Format output file.
limit (str) - Number of GDAs to retrieve.
restart (bool) - (False) [WF property] Do not execute if output files exist.
Examples
This is a use example of how to use the building block from Python:
from biobb_disgenet.disgenet.vda_disgenet import vda_disgenet
- prop = {
‘variant_id’: ‘variant_id’, ‘disease_id’: ‘disease_id’, ‘source’: ‘source’, ‘min_score’: ‘min_score’, ‘max_score’: ‘max_score’, ‘min_ei’: ‘min_ei’, ‘max_ei’: ‘max_ei’, ‘type’: ‘disease_type’, ‘disease_class’: ‘disease_class’, ‘min_dsi’: ‘min_dsi’, ‘max_dsi’: ‘max_dsi’, ‘min_dpi’: ‘min_dpi’, ‘max_dpi’: ‘max_dpi’, ‘min_pli’: ‘min_pli’, ‘max_pli’:’max_pli’, ‘format’: ‘format’, ‘limit’: ‘limit’ ‘min_year’:’min_year’, ‘max_year’:’max_year’, ‘offset’:’offset’
} vda_disgenet(retrieve_by=’variant’,
output_file_path=’/path/to/associationsFile’, properties=prop)
- Info:
- retrieve_by can be:
variant, gene, disease, source, evidences
- launch() int[source]
Execute the
VDADisgenetobject.
- disgenet.vda_disgenet.main()[source]
Command line execution of this building block. Please check the command line documentation.
- disgenet.vda_disgenet.vda_disgenet(retrieve_by: str, output_file_path: str, properties: Optional[dict] = None, **kwargs) int[source]
Create
VDADisgenetclass and execute thelaunch()method.