Comments (5)
The basic concept here is that metadata surrounding these voice clips will be kept separately from the voice clips themselves. You can find the voice clip using the metadata, but you should not be able to find the metadata given a voice clip. We will store the clips in the filesystem, and store metadata (including voice file path) in the db.
Later, when we publish the voice data, we can include non-PII metadata with the audio clips.
from common-voice.
Just to solidify things a bit, there are two classes of metadata:
- PII - For example an email address
- Non-PII - For example gender
We do not want it to be the case that
You can find the voice clip using the metadata
when the metadata is PII metadata. This should be true internally and externally.
However, internally and externally we do want to be able to find voice clips that have a particular set of non-PII metadata, e.g. all voice clips that fit a particular demographic profile.
from common-voice.
We'll need to do this in M3, after we have designed overall architecture.
from common-voice.
First draft of basic architecture is here:
https://docs.google.com/drawings/d/12FasUel5NH6poKLcJaxlYr64U6N6YEsyNs-wZJx_y04/edit
from common-voice.
The doc linked by George will be the living documentation for this. Closing.
from common-voice.
Related Issues (20)
- LOCALISATION REQUEST: EWONDO HOT 5
- LOCALISATION REQUEST: Gujarati HOT 5
- [BUG] REVIEW does not load HOT 5
- LOCALISATION REQUEST: HOT 8
- [BUG] Request to Update Twitter Icon to X HOT 4
- [Req] checkbox (submit a report) for artificial/unlogic sentences in Speak, Listen, Review needed ? HOT 1
- [Req] Addittional message that indicates in which language section contributor is in (Speak/Write) HOT 2
- [BUG] Labeling of green donate button not centered in landscape mode (android13)
- [BUG] No donate button in portrait mode next to avatar (android13) HOT 1
- LOCALIZATION REQUEST: Laz Language HOT 1
- [FR] Also give option to invalidate reported sentences. HOT 7
- [BUG] CV maintenance info not translated HOT 1
- [BUG] 3 ways of logging in CV website only 1 to log out HOT 1
- [BUG] After 5 validated clips in (en) contributor gets an award every time HOT 1
- Move language goals to translations HOT 1
- [BUG] Log in not possible - Callback URL mismatch (android13) HOT 3
- Deletions, on a fixed location HOT 6
- Last updated note for the stats page HOT 3
- [FR] Add text-corpus related statistics to the panel HOT 1
- [BUG] Incorrect display of statistics at CV website for Catalan language HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from common-voice.