Comments (8)
Hi,
Sorry for the delay in reply. NA implies that the status (hyper/hypo) of the DMRs could not be interpreted. I would suggest visualisation of few DMRs. But, if all the status is "NA" I would say it does not have enough power to identify the DMRs with confidence.
Thanks,
Akanksha
from home.
Hey, thank you for your response.
So do I understand it correctly: HOME first discovers a potential DMR in that region. But then it does not find enough confidence to report it. The DMR will be written to the output file anyways but with all columns NA.
I just think it is a little weird since it only happened in the CHH context and a confidence score is still reported. Maybe the DMR should not appear in the output at all then?
Best regards
Marius
from home.
Hi Marius,
Yes, so for time series data, HOME will try to find the difference between all the samples and will report the region of difference and its confidence (delta). It then checks for a consistent delta but is same direction (hyper/hypo) for possible pairwise comparisons. If its not able to find that it will report NA. You are right its a bit weird to get 'NAs' in all the samples. I guess its just telling you that the data does not have enough coverage for CHH methylation identification. Have to tried to visualise these DMRs on IGV or UCSC browser?
Thanks,
Akanksha
from home.
Hey Akanksha
I have visualized a few DMRs now using a Python script. For some of the "All NA" DMRs it is quite obvious why they are NA because there is rarely any data in the range of the DMR. But for others i think there should be enough data that coverage should not be an issue. I have attached a few plots below. Here the size of the dots represents the coverage and the color (blue to pink) the level of methylation.
These are the plots of DMRs that have >=50% of data in the Comb1-N columns:
These are DMRs where there is 100% NA in every column like in the first post I made, where I think it should be possible to call a DMR:
This is one of the 100% NA DMRs where I think its quite obvious why there is so much NA, but I just think it maybe should not be outputted at all:
Thanks alot for your help!
Marius
from home.
Hi Marius,
Thanks for sharing the plots. Its hard to visualise it like this. I think it will better to have a IGV or a UCSC plot in the standard format which will also show the direction of methylation. Please have a look at the HOME paper for reference.
Thanks,
Akanksha
from home.
Hey Akanksha,
I managed to visualize the methylation levels in IGV. However, I don't know how to reproduce the plots with the methylation difference like in the HOME paper. And btw, what do you mean with "direction of methylation". Do you include an extra track with the DMR information? Because the methylation files are only 0 - 1 and not -1 - 1. Or is there an option in IGV to produce a methylation difference track from the others?
Best regards,
Marius
from home.
Hi Marius,
You can upload the wig files from BSseeker2 for the samples and visualise the DMRs on IGV. I think this should give us the idea about what's the issue.
Thanks,
Akanksha
from home.
Hey Akanksha,
thank you for your response. I am using Bismark for my analysis and merged the + and - strand for all chromosomes, therefore i have bedgraph files that go from 0 to 1 instead of wig files that go from -1 to 1. Anyways it should not make a huge difference. I converted those bedgraph files to .tdf files for igv and made a screenshot of a 100% NA DMR. I hope this helps?
Best regards,
Marius
from home.
Related Issues (20)
- training data CG and non-CG for arabidopsis HOT 1
- HOME-timeseries error HOT 2
- sample path HOT 1
- cannot open file './scripts/HOME_R.R': No such file or directory HOT 1
- HOME-pairwise script is missing HOT 1
- HOME run does not end HOT 6
- Error when finding DMR's with scaffolded genome HOT 4
- multiprocessing range parameter HOT 3
- double scalar error HOT 5
- Issue 20# Home run doesnโt end HOT 6
- HOME-pairwise for different numbers of replicate HOT 4
- HOME ubale to open X server HOT 1
- HOME_DMR error HOT 1
- Push more
- Nanopore data? HOT 2
- NameError: name 'status' is not defined HOT 2
- HOME-timeseries Exception HOT 4
- Exception: reduce() of empty sequence with no initial value
- Installation Error
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from home.