GithubHelp home page GithubHelp logo

deib-geco / gmql-web Goto Github PK

View Code? Open in Web Editor NEW
5.0 10.0 3.0 3.61 MB

GMQL WEB Interface

Home Page: http://www.bioinformatics.deib.polimi.it/geco/?home

License: Apache License 2.0

Scala 1.04% CoffeeScript 0.63% CSS 1.84% HTML 0.36% JavaScript 96.13% Shell 0.01%
gmql gdms gdm genomics genomics-visualization gmql-web

gmql-web's Introduction

GMQL-WEB Build Status

GMQL-WEB project is designed and implemented for make GMQL project publicly available and easy to use by biologists and bioinformaticians.

Please visit GMQL-WEB wiki page for further information.

gmql-web's People

Contributors

acanakoglu avatar akaitoua avatar andreagulino avatar lucananni93 avatar marcomass avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

gmql-web's Issues

User query management

Make possible for any user to manage its queries, by deleting those not needed anymore, and saving on file on user laptop those to be backed-up (both for guest and registered users)

Datasets schema for upload

In uploadSampleUrls when it's set a schema type for the dataset, the system still expects the request to contain a schema file (error 406: Not Acceptable)

Dataset time stamp

We need to have associated to each dataset the date of its creation.
This should be stored somewhere (possibly in the xml structure where Andrea store the dataset profiling information) at the time of the dataset creation, and then shown in the web interface (using a web service created for this aim).

Title of metadata window

Now, clicking on a sample in the repository, you enabled the possibility to view also the sample metadata in a popup window.
That very nice. Yet, the title of such window is "Region data of ..."; it has to be changed to "Metadata of ..." (probably a copy past error).

Visit number in home page

In the GMQL web home page add:

  • the following sentence "This system is under active development, please forgive us for possible errors and send us your comments, criticisms and congratulations, if any."

  • the number of visit, as collected in Google Analytic

UCSC Genome Browser connection

Fix issues in visualizing datasets in UCSC Genome Browser, by adding unique ID-value pair in each "group" attribute (more test on this aspect should be done)

IGB connection

Enable connection to IGB by 1) exposing the repository for IGB "Quick load" (first public data, then possibly also private data of “upgraded” registered user); 2) establishing port connection to a local open instance of IGB in order to directly visualize results in it as in UCSC Genome Browser

Download timeout

Download service as zip file is starting very slowly and this leads to timeout in the network connection.

metadata window - column visibility persistence

In the new metadata window, after closing the window (maybe due just to check which was the query ...), column visibility selection is lost.
Can it be made persistent till the dataset is changed?

Schema name for private datasets

Correctly report the Schema name for private datasets as for public ones (for private ones now it is reported "DatasetName_SCHEMAS")

"Add" dataset window - conformity check of uploaded samples

During data upload check sample consistency vs. dataset schema; complete loading only if all lines of all samples of the uploaded dataset are compliant, whereas stop upload at the first line not compliant, and report the error, the line and sample where it occurs.

Rational and details as follows:
When a new dataset is created with samples that have less region attributes than those listed in the selected dataset schema, currently uploading ends with success, but then the processing of such dataset ends (successfully) with empty result!
If one or more of the uploaded samples include more attributes than those listed in the selected schema, currently the system manages this situation by loading only the values of a number of the first/leftmost sample attributes equal to the number of attributes listed in the schema; this is the only thing that can be done, but it is right only when the additional sample attributes are the last, i.e. the rightmost, ones in the sample.
Check correctness of selected schema for the uploaded samples while creating the new dataset, and accordingly provide a message to the user (either if any of their lines contain less or more region than expected), and stop loading, so that to avoid not conformant samples in the repository.

Metadata browser - enhaced support for memorizing visualization choices

In Metadata browser, make possible changing:
- Dataset without loosing all defined attribute-value predicates
- Any (not only the last one) selected attribute and/or value
- Now it is not possible to download a dataset different from the one selected for Metadata browser without loosing all the defined attribute-value predicates in the Metadata browser

Datasets info

Extend XML schema file adding new attributes for dataset content description.

It is important to associate with each datasets some info about it and the data it contains (e.g., type of contained data, where retrieved from, creation(update date, number of samples included, amount of bytes, etc. ...
Some of these information can be provided directly by the GMQL-Importer software, other should be provided through the solving of issue #2 and #28.
In the xml schema file of GMQL datasets it is important to define an" information" tag which includes several subtags, one for each info to provide (see above examples).
This implies changing the xsd associated with the xml schema and the implementation of both the web interface (in order to make this information accessible and display it properly) and the implementation, in order to parse the xml file and make the information available to the web interface.
It is important to make such changes to the implementation so that all subtags included in the information tag are parsed and displayed in the web interface, so that if new info are required to be displayed, it will be then enough to add them in the xml (in a proper subtag under the information tag) and change the xsd without any further change to the implementation.

Cleaned semplified execution log shown to the user

To avoid giving a bad impression to the reviewers of our work, the execution log shown in the web interface should not include the current several lines of warnings which "appear as possible errors".
Such warnings can be useful, particularly for the developers, but not for the final web user.

I would provide to web user only a shorter log, including only INFO, ERROR, and others lines, and possibly a link to the current full log (including also WARN lines).

This can be done by generating in the core GMQL another log file with only the log lines tagged as INFO, ERROR, and others (by replicating what has been done for generating the current log files, without including the lines tagged as WARN).

The new log file will be shown in the web interface in place of the current one, together with a link to show, only if/when required, the current full log file in another window for more insights (having still the full log is useful when some issue occurs, in order to understand and fix it more easily/quickly).

Document interface feature clearly and intuively

Inform the user about the interface available features, e.g. about GMQL statement autocomplete in query editor, or about how to search in the window; this can be done, for example, by adding a note below the "Query editor" title, such as " *: Press CTRL + spacebar for GMQL command auto-complete support", or a note such as "Press CTRL + f key for text search", respectively, or with a better solution to be thought.

Killed job message

When a job is killed, show “EXEC_STOPPED” instead of “EXEC_FAILED”

'enhancer' annotation sample

What's the issue in the 'enhancer' annotation sample which prevents its materialization when selected?

Running:
DATA_SET_VAR = SELECT(annotation_type == 'enhancer') HG19_BED_ANNOTATION;
MATERIALIZE DATA_SET_VAR INTO RESULT_DS;
the generated datest is empty (both in GTF and tab format)

User category management

Enable a user to request for an upgraded user category; consequently, an email should be sent to the user administrators listed (and easily updatable) in the system (but not publicly visible), and any of such administrator should be able to decide to accept or not the request and reply with a message to the user.

“Add” dataset window improvements (multiple subissues)

In the “Add” window for uploading personal datasets:

  • In the window title, change text “data set” to “dataset”

  • When an error occurs (e.g. a new dataset is created with the name of a data existing in the repository Private section), report why (now only “error” is reported); @akaitoua I thik this requires Abdulrahman fix

[previous additional issues moved to issue #48 and #49]

Metadata browser - NOT attribute

Add possibility to specify in Metadata browser (and then in the GMQL select) the request that the searched samples have null value for a specific metadata attribute (i.e. that they do not have any value for that attribute, that is they do not have that metadata attribute, as it is already possible in the GMQL select by specifying "NOT(attribute == '*')").

Encoding

When uploading dataset samples some character encoding seams not working properly; in fact, when downloading metadata file some characters are not recognize in the local encoding system (e.g. ANSI - Latin1) resulting in some unreadable string (e.g. 17��-estradiol); can the encoding info be embedded in the uploaded/downloaded file?

job log execution time

  • In User jobs and Query logjob windows, if Elapsed time / Execution time is more than 60, express it in "minutes, seconds"; if it more than 3600, express it in "hours, minutes, seconds"

Dataset quantitative features

Provide information about the size (in byte), number of samples and the number of regions in a dataset (and the number of regions in a sample), besides the date of each dataset

metadata window - all colum disabling

In the new metadata window, for the column visibility, is it possible to add the option of disabling all columns?

This would be very useful when there are many columns and you want to view only very few (the most typical case).

Compile_success (compile_failed) button action

When first a job is run, then after it is finished a new query is compiled, pressing the Compile_success (compile_failed) button open the log window with as content the log of the previous job run.

Query editor window size button

Add a button to enlarge/shorten the Query editor window also at the bottom of the window (not only on top); it is very useful in case of long text queries.

GMQL keywords (color identified)

Add as keywords of the language, to be shown in different color in the query editor window, the following ones:
DIST, DG, DL BAGD
LEFT_DISTINCT, RIGHT_DISTINCT, BOTH

Datasets area horizontally resizable

Make the Datasets area horizontally resizable (this can help when datasets have long names).
start adding a button as in query editor to enlarge the Datasets area

User category and resource management (multiple subissues)

  • Define user categories (base/intermediate/advanced/admin) and manage the resources (disk space and computational power) assigned to each user category / specific user (this is very important for the correct use of the Cineca cluster)

  • Implement a limitation on resource usage for users who are not in the upgraded category (to be evaluated the differentiation of resource usage limitation between public and registered user, but not in the upgraded category)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.