GithubHelp home page GithubHelp logo

Comments (9)

todrobbins avatar todrobbins commented on May 24, 2024 1

I think we should be verbose in the values and list the organization_level as Organization or Organizational Unit.

Or another proposal would be:

  • "organization_level": "https://www.w3.org/ns/org#Organization"
  • "organization_level": "https://www.w3.org/ns/org#OrganizationalUnit"

from publicbodies.

augusto-herrmann avatar augusto-herrmann commented on May 24, 2024 1

Here's another idea: add organizational units in a different CSV file.

Organizational units can be very numerous, around several thousands for each country. They also tend to be updated in structure much more often. Putting them in a separate file will make downloading easier for people who are only interested in the main organizations. It would also be possible to have a different update schedule for them.

The main organization would remain where they are, at /data. The complete file with main organization and units could all be put in a subfolder named /data/organizational_units, so we would have a new folder with CSV files with the same names and the same schema as the main ones, but much larger.

from publicbodies.

augusto-herrmann avatar augusto-herrmann commented on May 24, 2024

Some countries' data (e.g. Switzerland) already seem to include organizational units.

If it's in scope I would also include this data from Brazil, as it is available.

from publicbodies.

rufuspollock avatar rufuspollock commented on May 24, 2024

@augusto-herrmann @hannesgassert I think it would be good to add these but we should agree the column name and meaning and add to datapackage.json first ...

from publicbodies.

augusto-herrmann avatar augusto-herrmann commented on May 24, 2024

How about this?

          {
            "id": "type",
            "type": "string",
            "description": "Type of entry: 'o' for organization level, 'ou' for organizational unit level"
          },

from publicbodies.

rufuspollock avatar rufuspollock commented on May 24, 2024

@augusto-herrmann seems sensible though I dislike "type" as it is so overloaded. Perhaps "organizationType" or "organization-level" might be better.

/cc @hannesgassert

from publicbodies.

augusto-herrmann avatar augusto-herrmann commented on May 24, 2024

Agreed.

But we should use "organization_level" (with an underscore) in order to be consistent with the word separation scheme used in the rest of the column names.

from publicbodies.

augusto-herrmann avatar augusto-herrmann commented on May 24, 2024

If no one opposes this change to the data model, I should add this soon-ish.

Existing data should be updated with the new organization_level column and their respective cells kept blank until they can be filled in from official sources.

from publicbodies.

augusto-herrmann avatar augusto-herrmann commented on May 24, 2024

That file would contain the full structure of government down to the smallest internal unit. This data tends to get very large very quickly and update very frequently.

We recently started publishing a daily csv of this for Brazil, and it's a 124 MB file. That is not so large, but to keep track of its changes in Git it may make the repository a lot slower and unwieldly.

I'm open to discussing other alternatives. Or whether or not it is really ok to store a file as large as this, frequently updated, in a Git repo.

Your thoughts, @todrobbins, @rufuspollock, @hannesgassert?

from publicbodies.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.