Comments (9)
I think we should be verbose in the values and list the organization_level
as Organization
or Organizational Unit
.
Or another proposal would be:
"organization_level": "https://www.w3.org/ns/org#Organization"
"organization_level": "https://www.w3.org/ns/org#OrganizationalUnit"
from publicbodies.
Here's another idea: add organizational units in a different CSV file.
Organizational units can be very numerous, around several thousands for each country. They also tend to be updated in structure much more often. Putting them in a separate file will make downloading easier for people who are only interested in the main organizations. It would also be possible to have a different update schedule for them.
The main organization would remain where they are, at /data
. The complete file with main organization and units could all be put in a subfolder named /data/organizational_units
, so we would have a new folder with CSV files with the same names and the same schema as the main ones, but much larger.
from publicbodies.
Some countries' data (e.g. Switzerland) already seem to include organizational units.
If it's in scope I would also include this data from Brazil, as it is available.
from publicbodies.
@augusto-herrmann @hannesgassert I think it would be good to add these but we should agree the column name and meaning and add to datapackage.json first ...
from publicbodies.
How about this?
{
"id": "type",
"type": "string",
"description": "Type of entry: 'o' for organization level, 'ou' for organizational unit level"
},
from publicbodies.
@augusto-herrmann seems sensible though I dislike "type" as it is so overloaded. Perhaps "organizationType" or "organization-level" might be better.
/cc @hannesgassert
from publicbodies.
Agreed.
But we should use "organization_level" (with an underscore) in order to be consistent with the word separation scheme used in the rest of the column names.
from publicbodies.
If no one opposes this change to the data model, I should add this soon-ish.
Existing data should be updated with the new organization_level
column and their respective cells kept blank until they can be filled in from official sources.
from publicbodies.
That file would contain the full structure of government down to the smallest internal unit. This data tends to get very large very quickly and update very frequently.
We recently started publishing a daily csv of this for Brazil, and it's a 124 MB file. That is not so large, but to keep track of its changes in Git it may make the repository a lot slower and unwieldly.
I'm open to discussing other alternatives. Or whether or not it is really ok to store a file as large as this, frequently updated, in a Git repo.
Your thoughts, @todrobbins, @rufuspollock, @hannesgassert?
from publicbodies.
Related Issues (20)
- Nepal: add import scripts and schedule HOT 1
- Bot trying to update 2 data sources simultaneously creates conflict
- Fix infinite redirect and get site back online HOT 9
- Keep running update process even if step fails
- Convert website to Jekyll + Github Pages + Github Actions
- Implement retry in data import scripts
- Automatic parallel updates conflict with each other
- Rename `master` branch to `main` HOT 1
- Greece: values in `id` field are not sluggable
- Github Pages build takes too long and times out
- Data update scripts are still using the master branch HOT 1
- CSV download button still points to `master` branch
- Github Pages default Jekyll deploy does not render some pages properly
- Commit, push & rebase GH Action not working on `main` branch
- Site gives 404 error HOT 3
- Replace broken URL for dados.gov.br CKAN API HOT 2
- Add Switzerland to the list
- `import_br.py` works locally, but fails in Github Actions HOT 2
- Upgrade Frictionless
- Upgrade deprecated Github Action scripts
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from publicbodies.