GithubHelp home page GithubHelp logo

Comments (4)

mikayelh avatar mikayelh commented on September 27, 2024

hey @juice500ml , thanks a lot for bringing this issue up. My reading of the user agreement is a bit different, 'for other purposes' may mean 'non-research' or commercial purposes, which this is not. Besides, it's been available for a while now via Deep Lake and Hugging Face (here), so not sure if it's an issue.

Are you by any chance with LDC? We could potentially include a user agreement before user accesses the dataset, which is common in such cases.

from deeplake.

juice500ml avatar juice500ml commented on September 27, 2024

Dear @mikayelh , thanks a lot for a quick response!
To my understanding, this clause becomes a problem for redistributing the data.

Unless explicitly permitted herein, User shall not otherwise publish, retransmit, disclose, display, copy, reproduce or redistribute the LDC Databases to others outside of User’s Research Group.

In this case, I think, everyday user of your project would be well outside of the definition of "User's Research Group", or I might be wrong.
Also, I believe what you mentioned about huggingface distribution is this, right? https://huggingface.co/datasets/timit_asr
In that case, one has to download the data from LDC manually.

My affiliation (CMU) is part of LDC, but I'm not exactly with LDC, so I won't be able to answer those kind of questions :( Actually, I was looking ways to include LDC within a public project also, and I stumbled upon this project.

from deeplake.

mikayelh avatar mikayelh commented on September 27, 2024

Got it, @juice500ml! I'll reach out to the contact listed on their website and we will take down the dataset or include the user agreement if they desire so. Not sure about this specific dataset, but a large part of datasets, including this one, has been included long ago and we filtered out ones that were restrictive, so unless this agreement was implemented later on, that wouldn't be an issue.

In your specific case though seems like you'd be able to use the dataset via Deep Lake without an issue.

Thanks again for letting us know!

from deeplake.

juice500ml avatar juice500ml commented on September 27, 2024

I see, hope everything works out! Thanks a lot!

from deeplake.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.