Comments (4)
hey @juice500ml , thanks a lot for bringing this issue up. My reading of the user agreement is a bit different, 'for other purposes' may mean 'non-research' or commercial purposes, which this is not. Besides, it's been available for a while now via Deep Lake and Hugging Face (here), so not sure if it's an issue.
Are you by any chance with LDC? We could potentially include a user agreement before user accesses the dataset, which is common in such cases.
from deeplake.
Dear @mikayelh , thanks a lot for a quick response!
To my understanding, this clause becomes a problem for redistributing the data.
Unless explicitly permitted herein, User shall not otherwise publish, retransmit, disclose, display, copy, reproduce or redistribute the LDC Databases to others outside of Userβs Research Group.
In this case, I think, everyday user of your project would be well outside of the definition of "User's Research Group", or I might be wrong.
Also, I believe what you mentioned about huggingface distribution is this, right? https://huggingface.co/datasets/timit_asr
In that case, one has to download the data from LDC manually.
My affiliation (CMU) is part of LDC, but I'm not exactly with LDC, so I won't be able to answer those kind of questions :( Actually, I was looking ways to include LDC within a public project also, and I stumbled upon this project.
from deeplake.
Got it, @juice500ml! I'll reach out to the contact listed on their website and we will take down the dataset or include the user agreement if they desire so. Not sure about this specific dataset, but a large part of datasets, including this one, has been included long ago and we filtered out ones that were restrictive, so unless this agreement was implemented later on, that wouldn't be an issue.
In your specific case though seems like you'd be able to use the dataset via Deep Lake without an issue.
Thanks again for letting us know!
from deeplake.
I see, hope everything works out! Thanks a lot!
from deeplake.
Related Issues (20)
- [FEATURE] Transform custom dataset to deeplake dataset/database/vectorstore conveniently using DDP HOT 5
- [BUG] Read-Only Vectorstore with GCS persistence goes stale HOT 7
- [BUG] ds.visualize not working in jupyter notebook for local dataset HOT 9
- [BUG] HOT 2
- [BUG] ds.visualize cannot work offline in jupyter notebook with local dataset HOT 7
- Not Logged in Agreement Error HOT 1
- [BUG] Can NOT run deeplake python library HOT 3
- [BUG] Filter across tensors in VectorStore Search HOT 3
- [BUG] google-auth is too old to use service account impersonation
- [BUG] paulgraham_essays cannot store to personal account HOT 1
- [BUG] deeplake.util.exceptions.ReadSampleFromChunkError HOT 5
- How to export data? HOT 7
- [FEATURE] Customizable location for .activeloop directory and handling multiple users with the same client HOT 2
- [BUG] Datasets not accessible in Google Colab HOT 4
- [BUG] I cannot create an empty dataset on custom s3 location due to signed header HOT 5
- [FEATURE] Is there a way to download the dataset and run it locally? HOT 2
- The process of loading the dataset via deeplake.load('hub://crossvivit/SunLake') is experiencing significant delays. HOT 2
- AuthorizationException HOT 6
- [BUG] Issue with 300-W DataSet
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deeplake.