GithubHelp home page GithubHelp logo

z0on / emapper_to_gomwu_kogmwu Goto Github PK

View Code? Open in Web Editor NEW
5.0 5.0 2.0 3.79 MB

A few awk one-liners to extract data tables compatible with GO_MWU and KOGMWU methods out of eggNOG-eMapper output

emapper_to_gomwu_kogmwu's People

Contributors

z0on avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

emapper_to_gomwu_kogmwu's Issues

KOGMWU error about differing rows

Thanks for creating the KOGMWU package. Not sure if this is the best venue to post a question, but I'm running into the following error about differing rows when doing a KOGMWU analysis:

kog.mwu(temp, ApalmKOG, Alternative = "t")

Warning: NAs introduced by coercionWarning: cannot xtfrm data frames[1] "Continuous measure of interest: will perform MWU test"
[1] "skipping integer(0) nseqs = 1"
Error in data.frame(..., check.names = FALSE) :
arguments imply differing number of rows: 1, 0

6. stop(gettextf("arguments imply differing number of rows: %s", paste(unique(nrows), collapse = ", ")), domain = NA)
5. data.frame(..., check.names = FALSE)
4. cbind(deparse.level, ...)
3. cbind(term = as.character(terms), res)
2. kog.mwut(annotated, Alternative)
1. kog.mwu(temp, ApalmKOG, Alternative = "t")

My input files:

data.txt
gene2kog.txt

Is there a reason why I'm getting this error? From my understanding, the two input data frames can have differing rows. Thanks for your help!

Why not multiple KOG classes?

Question mostly about KOG_MWU: is there a statistical or biologically-relevant reason to exclude genes with multiple KOG annotations, unlike GO_MWU which allows multiple GO annotations per gene? Is it because there isn't the same kind of hierarchy with KOG?

Extracting KOG info and gene names for KOGMWU

Hi,

I want to make comparisons between the KOG annotation info from two different species/datasets using KOG_MWU. Each has eggnog annotation data, but I am having trouble extracting the info with the code you provided. Can you elaborate on what your commands are doing (or an example of the end format of the files needed for KOGMWU)?

The annotation information I have was generated with two different versions of eggnog (species 2 output was reformatted with dammit), so I am wondering if I need to re-run so that the output is similar. I am also noticing that in addition to KOG IDs, I am getting COG and ENOG IDs, but I am not sure what these are, or if they can be used?

I have attached an example of the output for each species for reference. I appreciate any help/advice you may have!

Species1.txt
Species2.txt

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.