Monday, 29 December 2008

genetics - Mapping a mutation to known SNP, 3' UTR, miR

I've parsed out a very large TCGA cancer ssm (single mutation file) file to give me the essential information.



The ssm is in the following format:



['Gene name', 'Ensembl Gene ID', 'Chromosome', 'Chromosome start', 'Cancer Type']
['NTRK1', 'ENSG00000198400', '1','156849827', 'Prostate Adenocarcinoma (TCGA, US)']


From there I would like to grab each mutation and :



  1. Map the chromosomal position to a known SNP (rs something output).

  2. See if this snp is found in a 3'UTR

  3. See if this snp is found in a miRNA

  4. Missense or sense mutation

  5. Any relevant genbank etc.. ids

I'd like to do this Python (I think BioPython is suited for this) for downstream applications.

No comments:

Post a Comment