Are in scope of the data for this project: a) only organizations (as

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

How about this? <div class="highlight highlight-source-json notranslate position-r

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Decide whether or not organizational units are in scope about publicbodies HOT 9 OPEN

augusto-herrmann commented on May 24, 2024

Decide whether or not organizational units are in scope

from publicbodies.

Comments (9)

todrobbins commented on May 24, 2024 1

I think we should be verbose in the values and list the organization_level as Organization or Organizational Unit.

Or another proposal would be:

"organization_level": "https://www.w3.org/ns/org#Organization"
"organization_level": "https://www.w3.org/ns/org#OrganizationalUnit"

from publicbodies.

augusto-herrmann commented on May 24, 2024 1

Here's another idea: add organizational units in a different CSV file.

Organizational units can be very numerous, around several thousands for each country. They also tend to be updated in structure much more often. Putting them in a separate file will make downloading easier for people who are only interested in the main organizations. It would also be possible to have a different update schedule for them.

The main organization would remain where they are, at /data. The complete file with main organization and units could all be put in a subfolder named /data/organizational_units, so we would have a new folder with CSV files with the same names and the same schema as the main ones, but much larger.

from publicbodies.

augusto-herrmann commented on May 24, 2024

Some countries' data (e.g. Switzerland) already seem to include organizational units.

If it's in scope I would also include this data from Brazil, as it is available.

from publicbodies.

rufuspollock commented on May 24, 2024

@augusto-herrmann @hannesgassert I think it would be good to add these but we should agree the column name and meaning and add to datapackage.json first ...

from publicbodies.

augusto-herrmann commented on May 24, 2024

How about this?

          {
            "id": "type",
            "type": "string",
            "description": "Type of entry: 'o' for organization level, 'ou' for organizational unit level"
          },

from publicbodies.

rufuspollock commented on May 24, 2024

@augusto-herrmann seems sensible though I dislike "type" as it is so overloaded. Perhaps "organizationType" or "organization-level" might be better.

/cc @hannesgassert

from publicbodies.

augusto-herrmann commented on May 24, 2024

Agreed.

But we should use "organization_level" (with an underscore) in order to be consistent with the word separation scheme used in the rest of the column names.

from publicbodies.

augusto-herrmann commented on May 24, 2024

If no one opposes this change to the data model, I should add this soon-ish.

Existing data should be updated with the new organization_level column and their respective cells kept blank until they can be filled in from official sources.

from publicbodies.

augusto-herrmann commented on May 24, 2024

That file would contain the full structure of government down to the smallest internal unit. This data tends to get very large very quickly and update very frequently.

We recently started publishing a daily csv of this for Brazil, and it's a 124 MB file. That is not so large, but to keep track of its changes in Git it may make the repository a lot slower and unwieldly.

I'm open to discussing other alternatives. Or whether or not it is really ok to store a file as large as this, frequently updated, in a Git repo.

Your thoughts, @todrobbins, @rufuspollock, @hannesgassert?

from publicbodies.

Decide whether or not organizational units are in scope about publicbodies HOT 9 OPEN

Comments (9)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs