GithubHelp home page GithubHelp logo

city-bureau / city-scrapers-cle Goto Github PK

View Code? Open in Web Editor NEW
14.0 9.0 14.0 1.94 MB

City Scrapers project for Cleveland

Home Page: https://cityscrapers.org/

License: MIT License

Python 99.87% Shell 0.13%
web-scraping python open-data scrapy city-scrapers

city-scrapers-cle's People

Contributors

alfenn avatar dependabot[bot] avatar haileyhoyat avatar lamle-ea avatar pjsier avatar rajivsinclair avatar simmonsritchie avatar skorasaurus avatar zabrahams avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

city-scrapers-cle's Issues

Cuyahoga County Council scraper is broken

Reported by Kate W. in the documenters Open House

on the documenters.org ; there are no meetings listed for Cuyahoga County Council.

The documenters url for county council is
https://cleveland.documenters.org/agencies/cuyahoga-county-council-209/
and upcoming County Council meetings should be listed there... but they are not. .

The scraper is specifically using an API from county council's website - http://council.cuyahogacounty.us/api/items/GetItemsByType?itemTypeCode=EVENT;NEWS;EVENTREG&languageCd=en-US&siteKey=141&returnEventsAfterDate=06/05/2020&returnEventsBeforeDate=12/02/2020
to obtain results and all of the meetings that returned happened this summer.

Looking at the county council website, there's no meetings listed ?! http://council.cuyahogacounty.us/en-US/events-calendar.aspx

update cle_city_scrapers

hello. @skorasaurus

the cle_city_scrapers spider currently failing due to an empty meeting entry (see screenshot). specifically, the start time is empty because the meeting is cancelled, and therefore the start time is returning as a NoneType object as opposed to a datatime object.

PR here: #56

Screen Shot 2022-10-19 at 1 16 46 PM

Cleveland Documenters Updates

I'm Lawrence Caswell, field coordinator for Cleveland Documenters. A few things:

THE MAIN THING: The Cle Docs team has decided to focus primarily on Cleveland City Council for this pilot year.
Capacity and impact were the main factors in the decision. Of note: all of Council and the mayor are up for reelection next year. This doesn't mean we won't be documenting other meetings. It does mean that we need to make sure the docs.org pages for Cleveland City Council and its committees have the information our documenters will need. We’re reviewing those pages now.

2 questions:

  • What can I do to help?
  • We have been added to a mailing list for info from Council. Would access to that email be useful for City Scrapers?

The other updates:

  • We plan to hold our first Cleveland Documenters orientation and “Covering Your Local Government” training in late October. We should begin assigning meetings the following week.

  • Beginning this week, we’re holding Cleveland Documenters virtual “office hours” every Friday from 2-3pm via Zoom. Drop in during the hour to ask questions, meet others in the network, and to talk about how we can build this community. City Scrapers included!. Here’s the FB event page: https://www.facebook.com/events/800955484057069/

cle_design_review fix

@skorasaurus

hey. Paul mentioned this scraper broke yesterday (08/22/2022). i think i fixed it, making a PR now.

bug: lines#83-85 .split() the text for each select item, and assumes the return of two items max. however, some options have additional verbiage on their item so returns more than two items.

fix: lines#83-88 (on PR#53) now extract the month and day separately instead of assuming the return of two items.

Screen Shot 2022-08-23 at 9 32 28 AM

Landmarks Commision

At https://cleveland.documenters.org/scrapers/ ; it's reported the landmarks committee doesn't have any upcoming meetings

testing

But at https://cleveland.documenters.org/agencies/cleveland-landmarks-commission-158/ upcoming meetings are listed.

and then if you go to the individual meeting's page within documenters, (e.g.
https://cleveland.documenters.org/meetings/landmarks-commission-32679/) it says the meeting is canceled. !??!?

This appears to be the old page that was scraped: http://city.cleveland.oh.us/CityofCleveland/Home/Government/CityAgencies/CityPlanningCommission/LandmarksCommission

This is the new page: https://planning.clevelandohio.gov/landmark/AGENDALIST.html

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.