iqtlabs / gitgeo Goto Github PK
View Code? Open in Web Editor NEWDiscover the geography of open-source software.
License: Apache License 2.0
Discover the geography of open-source software.
License: Apache License 2.0
A feature to automatically build a new package and upload it to PyPi when you push a new git tag.
The advantage is less complexity and thinking is needed on the part of the developer, as well as the git tags reflect the PyPi reality automatically.
There is an error with this repository's Renovate configuration that needs to be fixed. As a precaution, Renovate will stop PRs until it is resolved.
Error type: Cannot find preset's package (github>whitesource/merge-confidence:beta)
This issue lists Renovate updates and detected dependencies. Read the Dependency Dashboard docs to learn more.
These updates are currently rate-limited. Click on a checkbox below to force their creation now.
These are blocked by an existing closed PR and will not be recreated unless you click a checkbox below.
Dockerfile
.github/workflows/codeql-analysis.yml
actions/checkout v2
github/codeql-action v1
github/codeql-action v1
github/codeql-action v1
.github/workflows/pypi.yml
actions/checkout v2
.github/workflows/python-package.yml
actions/checkout v2
actions/setup-python v2
codecov/codecov-action v1
requirements.txt
beautifulsoup4 ==4.10.0
folium ==0.12.1
pandas ==1.3.4
pytest ==6.2.5
requests ==2.26.0
https://github.com/Novetta/CLAVIN
This repo from Novetta does geoparsing ("Paris, France" --> "France") and is likely superior to our quick and dirty geoparser. While the code is in Java, I think, there is also the ability to run the REST version of this code as a docker container.
There is no need to pursue this currently. If there is strong interest in improving GitGeo's geoparsing, then this is simply one option to be evaluated.
@kdobolyi, for your situational awareness. Peter Bronez brought this to my attention because Charlie Greenbacker was a contributor to CLAVIN.
Describe the bug
When using the PyPI package of GitGeo, I get an error.
To Reproduce
pip install gitgeo
gitgeo --help
Expected behavior
This output:
usage: main.py [-h] [--package PACKAGE] [--repo REPO] [--multirepo]
[--multirepo_map MULTIREPO_MAP] [--summary] [--output_csv]
[--map] [--num {100,200,300,400,500}]
optional arguments:
-h, --help show this help message and exit
--package PACKAGE Specify Python (PyPI) package.
--repo REPO Specify GitHub repo.
--multirepo Scan multiple repos from input file.
--multirepo_map MULTIREPO_MAP
Convert mutlirepo scan file into map.
--summary Display results by country.
--output_csv Output results in csv.
--map Display country by country results in map.
--num {100,200,300,400,500}
Specify max number of contributors per repo.
The Error I Get
Traceback (most recent call last):
File "/Users/jmeyers/opt/anaconda3/bin/gitgeo", line 5, in <module>
from gitgeo.main import main
File "/Users/jmeyers/opt/anaconda3/lib/python3.7/site-packages/gitgeo/main.py", line 6, in <module>
from gitgeo.mapping import make_map
File "/Users/jmeyers/opt/anaconda3/lib/python3.7/site-packages/gitgeo/mapping.py", line 16, in <module>
from gitgeo.geolocation import get_country_from_location
File "/Users/jmeyers/opt/anaconda3/lib/python3.7/site-packages/gitgeo/geolocation.py", line 3, in <module>
from gitgeo.geographies_list import (
File "/Users/jmeyers/opt/anaconda3/lib/python3.7/site-packages/gitgeo/geographies_list.py", line 41, in <module>
with open(Path(__file__).with_name("country_codes.csv"), errors="ignore", newline="") as file:
FileNotFoundError: [Errno 2] No such file or directory: '/Users/jmeyers/opt/anaconda3/lib/python3.7/site-packages/gitgeo/country_codes.csv'
Desktop (please complete the following information):
Any help would be appreciated!
Describe the bug
Test case failing.
To Reproduce
See GitHub actions log.
Expected behavior
Tests should pass.
@kdobolyi, can you please fix the tests or codebase so that the GitHub actions tests pass.
Desktop (please complete the following information):
Additional context
I think the new PR has tests that don't pass yet.
Need to investigate!
Describe the bug
The multirepo scan capability is currently broken.
To Reproduce
gitgeo --multirepo
Expected behavior
Scan the repos specified in repos.txt, returning a csv of contributors and their locations.
Screenshots
Traceback (most recent call last):
File "/Users/jmeyers/opt/anaconda3/bin/gitgeo", line 8, in <module>
sys.exit(main())
File "/Users/jmeyers/opt/anaconda3/lib/python3.7/site-packages/gitgeo/main.py", line 123, in main
scan_multiple_repos(num=args.num)
File "/Users/jmeyers/opt/anaconda3/lib/python3.7/site-packages/gitgeo/multi_repo_scan.py", line 26, in scan_multiple_repos
create_csv("multirepo", timestamp)
File "/Users/jmeyers/opt/anaconda3/lib/python3.7/site-packages/gitgeo/custom_csv.py", line 23, in create_csv
with open(filename, "w", encoding="utf-8", newline="") as file:
FileNotFoundError: [Errno 2] No such file or directory: 'results/multirepo_20211104-093528.csv'
Desktop (please complete the following information):
Additional context
I suspect the PyPI-ification of this package--specifically placing the Python files in a gitgeo sub-directory--is preventing the GitGeo tools from accessing the files in the results directory. Can you please advise and, if possible, fix?
Describe the bug
I tried to access the command line arguments, specifically the help
argument and GitGeo returned nothing. Is this the intended behavior?
To Reproduce
python main.py --help
Expected behavior
I expect this output:
usage: main.py [-h] [--package PACKAGE] [--repo REPO] [--multirepo]
[--multirepo_map MULTIREPO_MAP] [--summary] [--output_csv]
[--map] [--num {100,200,300,400,500}]
optional arguments:
-h, --help show this help message and exit
--package PACKAGE Specify Python (PyPI) package.
--repo REPO Specify GitHub repo.
--multirepo Scan multiple repos from input file.
--multirepo_map MULTIREPO_MAP
Convert mutlirepo scan file into map.
--summary Display results by country.
--output_csv Output results in csv.
--map Display country by country results in map.
--num {100,200,300,400,500}
Specify max number of contributors per repo.
Screenshots
I got no output.
Desktop (please complete the following information):
Additional context
I replaced the main() function function definition line in main.py with this line:
if __name__ == "__main__":
and the program performed as I expected. Is it okay if I replace this line? Or was this change made because of the recent PyPI-package-ification changes? Thank you, @jbenjoseph!
The API limits a user to 5000 requests per hour per token? But the ghtorrent website doesn't seem accessible. http://ghtorrent.org/ What gives?
Specifically, assess each of these API's: https://wiki.openstreetmap.org/wiki/Nominatim#Alternatives_.2F_Third-party_providers
Do any of them allow bulk requests (say 10K/second) and are still free?
Describe the bug
The tests fail locally.
To Reproduce
git clone https://www.github.com/iqtlabs/gitgeo
cd gitgeo
pytest
Expected behavior
The tests should pass locally.
Desktop (please complete the following information):
Additional context
Three tests fail:
FAILED tests/test_gitgeo.py::TestMapping::test_add_contributor_count_to_json - FileNotFoundError: [Errno 2] No such file or directory: 'world.json'
FAILED tests/test_gitgeo.py::TestMapping::test_make_map_from_repo - FileNotFoundError: [Errno 2] No such file or directory: 'world.json'
FAILED tests/test_gitgeo.py::TestMapping::test_make_map_from_csv - FileNotFoundError: [Errno 2] No such file or directory: 'world.json'
Request
@jbenjoseph, can you please either help me fix this or provide a PR? The bug clearly has to do with the Path-related functionality, but I don't understand that functionality that well. Thank you!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.