Hey, i am using HOME-timeseries with 23 Samples (no replicates) and i just noticed tha

All Comb1-n "NA" for most DMRs in CHH about home HOT 8 OPEN

listerlab commented on July 19, 2024

All Comb1-n "NA" for most DMRs in CHH

from home.

Comments (8)

Akanksha2511 commented on July 19, 2024

Hi,

Sorry for the delay in reply. NA implies that the status (hyper/hypo) of the DMRs could not be interpreted. I would suggest visualisation of few DMRs. But, if all the status is "NA" I would say it does not have enough power to identify the DMRs with confidence.

Thanks,
Akanksha

from home.

mkpython3 commented on July 19, 2024

Hey, thank you for your response.

So do I understand it correctly: HOME first discovers a potential DMR in that region. But then it does not find enough confidence to report it. The DMR will be written to the output file anyways but with all columns NA.
I just think it is a little weird since it only happened in the CHH context and a confidence score is still reported. Maybe the DMR should not appear in the output at all then?

Best regards
Marius

from home.

Akanksha2511 commented on July 19, 2024

Hi Marius,

Yes, so for time series data, HOME will try to find the difference between all the samples and will report the region of difference and its confidence (delta). It then checks for a consistent delta but is same direction (hyper/hypo) for possible pairwise comparisons. If its not able to find that it will report NA. You are right its a bit weird to get 'NAs' in all the samples. I guess its just telling you that the data does not have enough coverage for CHH methylation identification. Have to tried to visualise these DMRs on IGV or UCSC browser?

Thanks,
Akanksha

from home.

mkpython3 commented on July 19, 2024

Hey Akanksha

I have visualized a few DMRs now using a Python script. For some of the "All NA" DMRs it is quite obvious why they are NA because there is rarely any data in the range of the DMR. But for others i think there should be enough data that coverage should not be an issue. I have attached a few plots below. Here the size of the dots represents the coverage and the color (blue to pink) the level of methylation.

These are the plots of DMRs that have >=50% of data in the Comb1-N columns:

These are DMRs where there is 100% NA in every column like in the first post I made, where I think it should be possible to call a DMR:

This is one of the 100% NA DMRs where I think its quite obvious why there is so much NA, but I just think it maybe should not be outputted at all:

Thanks alot for your help!
Marius

from home.

Akanksha2511 commented on July 19, 2024

Hi Marius,

Thanks for sharing the plots. Its hard to visualise it like this. I think it will better to have a IGV or a UCSC plot in the standard format which will also show the direction of methylation. Please have a look at the HOME paper for reference.

Thanks,
Akanksha

from home.

mkpython3 commented on July 19, 2024

Hey Akanksha,

I managed to visualize the methylation levels in IGV. However, I don't know how to reproduce the plots with the methylation difference like in the HOME paper. And btw, what do you mean with "direction of methylation". Do you include an extra track with the DMR information? Because the methylation files are only 0 - 1 and not -1 - 1. Or is there an option in IGV to produce a methylation difference track from the others?

Best regards,
Marius

from home.

Akanksha2511 commented on July 19, 2024

Hi Marius,

You can upload the wig files from BSseeker2 for the samples and visualise the DMRs on IGV. I think this should give us the idea about what's the issue.

Thanks,
Akanksha

from home.

mkpython3 commented on July 19, 2024

Hey Akanksha,

thank you for your response. I am using Bismark for my analysis and merged the + and - strand for all chromosomes, therefore i have bedgraph files that go from 0 to 1 instead of wig files that go from -1 to 1. Anyways it should not make a huge difference. I converted those bedgraph files to .tdf files for igv and made a screenshot of a 100% NA DMR. I hope this helps?

Best regards,
Marius

from home.

All Comb1-n "NA" for most DMRs in CHH about home HOT 8 OPEN

Comments (8)

These are the plots of DMRs that have >=50% of data in the Comb1-N columns:

These are DMRs where there is 100% NA in every column like in the first post I made, where I think it should be possible to call a DMR:

This is one of the 100% NA DMRs where I think its quite obvious why there is so much NA, but I just think it maybe should not be outputted at all:

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs