Comments (15)
Actually, it seems that the only column where you can specify amino acid change must now be specified as: HGVSp_Short. Is this on purpose, or maybe just temporary? @schultzn @n1zea144 @jjgao
from cbioportal.
This is on purpose. HGVSp_Short represents the protein change value coming from the VEP annotator.
We also have another column (Amino_Acid_Change) for custom annotations, but this value has not made its way to the front-end yet.
from cbioportal.
but, what about others who don't have VEP annotated data? (this is true of most other installs). this also doesn't match the format details on the wiki. can we consider a more generic solution?
Sent from mobile phone. Please excuse typos.
-------- Original message --------
From: onursumer [email protected]
Date:05/13/2015 3:18 PM (GMT-05:00)
To: cBioPortal/cbioportal [email protected]
Cc: ecerami [email protected]
Subject: Re: [cbioportal] Mutation file format docs do not mention correct
column for Amino_Acid_Change (#17)
This is on purpose. HGVSp_Short represents the protein change value coming from the VEP annotator.
We also have another column (Amino_Acid_Change) for custom annotations, but this value has not made its way to the front-end yet.
—
Reply to this email directly or view it on GitHub.
from cbioportal.
Perhaps we should support an annotation property in the meta file for mutations? Based on this property the import code does the right thing?
On May 13, 2015, at 5:27 PM, ecerami [email protected] wrote:
but, what about others who don't have VEP annotated data? (this is true of most other installs). this also doesn't match the format details on the wiki. can we consider a more generic solution?
Sent from mobile phone. Please excuse typos.
-------- Original message --------
From: onursumer [email protected]
Date:05/13/2015 3:18 PM (GMT-05:00)
To: cBioPortal/cbioportal [email protected]
Cc: ecerami [email protected]
Subject: Re: [cbioportal] Mutation file format docs do not mention correct column for Amino_Acid_Change (#17)This is on purpose. HGVSp_Short represents the protein change value coming from the VEP annotator.
We also have another column (Amino_Acid_Change) for custom annotations, but this value has not made its way to the front-end yet.
—
Reply to this email directly or view it on GitHub.—
Reply to this email directly or view it on GitHub.
from cbioportal.
A while back (back in the old days!), I remember we had this same problem in TCGA where each sequencing center labeled the amino acid change differently, e.g. AMINO_ACID_CHANGE, AA_CHANGE, PROTEIN_CHANGE, etc. The MAF reader had the "smarts" to figure out which columns were specified, and pick out the right amino acid change column. If we did this, then the code would always work, regardless of MAF variations, and no need to configure anything. Thoughts?
from cbioportal.
I thought are still supporting Protein_Change in the absence of VEP annotations?
On May 14, 2015, at 11:25 AM, ecerami [email protected] wrote:
A while back (back in the old days!), I remember we had this same problem in TCGA where each sequencing center labeled the amino acid change differently, e.g. AMINO_ACID_CHANGE, AA_CHANGE, PROTEIN_CHANGE, etc. The MAF reader had the "smarts" to figure out which columns were specified, and pick out the right amino acid change column. If we did this, then the code would always work, regardless of MAF variations, and no need to configure anything. Thoughts?
—
Reply to this email directly or view it on GitHub.
from cbioportal.
@schultzn: As far as I can tell, only HGVSp_Short is recognized. Hence, if you have a MAF with Protein_Change or AA_Change, mutations just appear as "MUTATED", but have no amino acid change details.
from cbioportal.
I think we can fall back on Amino_Acid_Change if HGVSp_Short does not exist or is empty.
from cbioportal.
Ah, that's true. AMINO_ACID_CHANGE is in the code. But, if it's specified, it is stored in a separate field called aminoAcidChange, and only proteinChange is actually stored in the database.
I am looking at this: https://github.com/cBioPortal/cbioportal/blob/master/core/src/main/java/org/mskcc/cbio/maf/MafUtil.java
record.setProteinChange(TabDelimitedFileUtil.getPartString(getColumnIndex(PROTEIN_CHANGE), parts));
and
public static final String PROTEIN_CHANGE = "HGVSp_Short";
And in ExtendedMutationUtil:
String proteinChange = record.getProteinChange();
// if protein change is not valid, then use the string "MUTATED"
if (!isValidProteinChange(proteinChange))
{
proteinChange = "MUTATED";
}
from cbioportal.
I think if AMINO_ACID_CHANGE column exists in the MAF but not HGVSp_Short column, then we should set the protein change value to AMINO_ACID_CHANGE column value. Do we also need to consider other custom column names to get protein change value, or does AMINO_ACID_CHANGE suffice?
from cbioportal.
Now importing Amino_Acid_Change column with the commit 9a2f1cd
from cbioportal.
Thanks, @onursumer! I will update the wiki later today, and close out...
from cbioportal.
Hi @onursumer. I am trying to find this bug fix. Do you know which branch you committed to? Tx.
from cbioportal.
Somehow it created another tree in master branch, I think I was out of
sync. Just merged and pushed it to the master branch. Let me know if it
works now.
On Mon, May 18, 2015 at 4:54 PM, ecerami [email protected] wrote:
Hi @onursumer https://github.com/onursumer. I am trying to find this
bug fix. Do you know which branch you committed to? Tx.—
Reply to this email directly or view it on GitHub
#17 (comment)
.
from cbioportal.
Hurray! this now works. I have also updated the wiki doc to explain this: https://github.com/cBioPortal/cbioportal/wiki/File-Formats#mutation-data. Closing...
from cbioportal.
Related Issues (20)
- should we import the hg38 seed db by default
- Enable cache for multi-study sessions
- Add Stacked Bar Chart for Generic Assay to Oncoprint
- Enhance roles filtering in OAuth2SecurityConfig.java
- Add reVUE icon for EGFR
- Homepage search improvement
- Show summary stats on homepage
- docs landing page needs 'Data Contributor' role
- update homepage headers & footers and add contact page
- virtual study creation description bug
- data type + reference genome filter on homepage does not work with virtual studies HOT 1
- Add updated ER diagram
- remove feature flag for categorical generic assay HOT 1
- in study view on mouseover of entire row show looking glass icon
- show only specific samples in genomic evolution chart
- New Plots tab in study view is missing the "same gene" feature
- p-value and q-value should show 10^-10
- On plots tab if both axes are patient based attributes, show patient counts
- fix failling api_sync test on master
- follow up fixes pagination PRs HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cbioportal.