Týden 3 - UCSC a Ensembl Genome Browser - programovatelne dotazy a zber dat
Jednoduchý způsob interakce se službou Biomart bez potřeby R je přes http protokol:
curl -d@query.xml http://www.biomart.org/biomart/martservice/result
kde query.xml obsahuje dotaz (lze vygenerovat i na webu biomartu), např.:
query=xml version="1.0" encoding="UTF-8"?>
DOCTYPE Query>
<Query virtualSchemaName = "default" formatter = "TSV" header = "0" uniqueRows = "0" count = "" datasetConfigVersion = "0.6" >
<Dataset name = "hsapiens_gene_ensembl" interface = "default" >
<Filter name = "chromosome_name" value = "1"/>
<Filter name = "with_illumina_humanwg_6_v3" excluded = "0"/>
<Attribute name = "ensembl_gene_id" />
<Attribute name = "ensembl_transcript_id" />
</Dataset>
Query>
Hledani atributu (nebo filtru):
listAttributes(ensembl)[grep("go",listAttributes(ensembl)$name),]