GithubHelp home page GithubHelp logo

university-domains-list's Introduction

University Domains and Names Data List & API

Do you need a list of universities and their domain names? You found it!

This package includes a JSON file that contains domains, names and countries of most of the universities of the world.

Example usecases:

  • You can create a validation script that checks the email domain.
  • You can automatically generate a user's country and university by looking at their emails.

You can use this data source in three ways:

  • Use the JSON file as your data source and do whatever you like with your favourite programming language.
  • Use free hosted-API.
  • Use the tiny Python app to serve a fast API that you can query data.

1 - Using the Data Source

The whole data source is located in the world_universities_and_domains.json file. It is just a list of dictionaries in the following format:

[
	...
	{
	    "alpha_two_code": "TR",
	    "country": "Turkey",
	    "state-province": null,
	    "domains": [
	        "sabanciuniv.edu",
	        "sabanciuniv.edu.tr"
	    ],
	    "name": "Sabanci University",
	    "web_pages": [
	        "http://www.sabanciuniv.edu/",
	        "http://www.sabanciuniv.edu.tr/"
	    ],
	},
	...
]

If you want a smaller final payload and only need a subset of countries, run

filter.py $country1 [Optional: $country2]

from the root directory to return

filtered_world_universities_and_domains.json

NOTE: Some universities use a format like [user]@[department].[domain], but this list only contains the [domain] portion. For example, an email address might be [student]@cs.usc.edu, and this list will contain 'usc.edu', the domain for the University of Southern California. Take this into consideration if using this list for email address validation.

2 - Using The Hosted API

This is the easiest method if you're making a small project or just want to discover the data without any hassle. It is sponsored by Hipo and free. If you have a big project, please host it on your own server.

Some example searches:

The hosted API uses university-domains-list-api package.

3 - Using the built-in API on your server

You can access the python API via university-domains-list-api

Contribution

Please contribute to this list! We need your support to keep this list up-to-date. Do not hesitate to fix any wrong data. It is extremely easy. Just open a PR, or create an issue.

Contributors

  • Yiğit Güler
  • Tuna Vargı
  • Patrick Michelberger
  • Rasim Demirbay
  • Ryan White
  • Bilal Arslan
  • anwilli5
  • Thomas Bauer
  • Emin Mastizada
  • Jai
  • Jimi Ford
  • Lars Schwegmann
  • Sedat Karancı
  • Charles Bedrosian
  • Harrison Lo
  • mattdfloyd
  • Ender Ahmet Yurt
  • Enis Behiç Tuysuz
  • Syed Zakawat
  • Daksh Shah
  • Maizer Gomes
  • Denys Vitali
  • Ary Wibowo
  • Matt Floyd
  • Joris Boquet
  • Konstantin Ladutenko
  • Romain Odeval
  • remediate
  • Errorific
  • summerplaybook
  • hamedty
  • Sedat
  • Sotirios Roussis
  • majilesh
  • Itay Grudev
  • luungoc2005
  • Ajithkumar Sekar
  • Christopher Chen
  • Dimitris Karakostas
  • Chun Fei Lung
  • Mamat Rahmat
  • Wisnu Adi Nurcahyo
  • jvanstraten
  • Ekin Dursun
  • Kevin Bohinski
  • Lachlan Marnham
  • Baptiste Pellarin
  • Kelian Baert
  • more

Created and maintained by Hipo

university-domains-list's People

Contributors

4ni1ak avatar aatomical avatar aceofspades5757 avatar alanhamlett avatar antmaxi avatar aviiciii avatar bcwood avatar co-bby avatar dakshshah96 avatar dl-eric avatar elliotwutingfeng avatar enisbt avatar huseyinsimsekk avatar iamstiil avatar isaiascuevas avatar jarek-t avatar markperri avatar mudongliang avatar n1kio avatar onlined avatar silvagustin avatar smith120bh avatar sphaxa avatar sudeshana avatar thatomokoena avatar tugberkozkara avatar tunavargi avatar yigitguler avatar yqopensource avatar zsubzwary avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

university-domains-list's Issues

domain fix

Hello,

Name: "Bilecik Seyh Edebali University"
Please update domain: bilecik..edu.tr to bilecik.edu.tr

Thanks!

Dataset origin

This is a fantastic project! Thank you for sharing.

Where did the dataset of universities and domains come from?

Add University

we just need to add a New university CIC - Canadian International College Campus of cape bbreton university in canada, with the following data

{
    "web_pages": [
        "http://cic-cairo.com/"
    ],
    "alpha_two_code": "EG",
    "state-province": null,
    "country": "Egypt",
    "domains": [
        "cic-cairo.com"
    ],
    "name": "CIC - Canadian International College"
}

Universidad Loyola Andalucía

Please add this University created in 2011 in Spain.
Thank you very much.

{
"web_pages": [
"https://www.uloyola.es/"
],
"name": "Universidad Loyola Andalucía",
"alpha_two_code": "ES",
"state-province": Andalusia,
"domains": [
"uloyola.es"
],
"country": "Spain"
}

Switching from Unicode escape to UTF-8

I noticed that only ASCII characters used in JSON file, the other characters are identified by Unicode escapes. But we can use UTF-8 characters in a JSON file, as specified in RFC 7159, and RFC 8259. I suggest we should convert Unicode escapes to corresponding UTF-8 characters (codepoints).

Schools with Multiple Domains

There are some schools with multiple domains. e.g.:

University of Illinois Urbana-Champaign

  • illinois.edu
  • uiuc.edu

Although the last one will be redirected to the the first one, but for sake of email verification you might need both in the DB. Their faculties have emails from both domains.

My suggestion is to convert "domain" field from a string to list of strings and for cases like this we can add multiple entries.

If everyone are OK with this scheme I can create a PR.

Improve tests

We should improve the tests and check if all values are trimmed.

Example: #118

Correct an university name

hi, when I was having a look at your API, I encountered a typo. Could you please correct the name for that university is given below. Thanks for the API.

web_page: "http://www.mehmetakif.edu.tr/",
country: "Turkey",
domain: "mehmetakif.edu.tr",
name: "Mehemet Akif Ersoy University", --> this one.
alpha_two_code: "TR"

Merge with WikiData or use WikiData to extend/verify the list.

To increase completeness and reduce maintenance burden of URL and University metadata consider integrating with WikiData's University list. A WikiData SPARQL query can return the same information as provided in this university list in JSON if format=json.

WikiData is CC0 which is compatible with this list and you'll have the Wikipedia community invovled in maintainting the data too: https://www.wikidata.org/wiki/Wikidata:WikiProject_Universities

Keep this list and compare queries periodically or rely on Wikidata and focus on creating a view to monitor the data for errors and changes.

Adding university

Can you add this please ? Thank you.

{
"web_pages": [
"https://www.mmibordeaux.com/"
],
"alpha_two_code": null,
"state-province": null,
"country": "France",
"domains": [
"mmibordeaux.com"
],
"name": "DUT MMI - Université Bordeaux Montaigne"
}

Add Liverpool John Moores University

{
    "alpha_two_code": "UK",
    "country": "United Kingdom",
    "domain": "ljmu.ac.uk",
    "name": "Liverpool John Moores University",
    "web_page": "http://www.ljmu.ac.uk/"
},

Add countryCode : ISO3166-1-Alpha-2 field

When mapping this JSON across various other data sets, it is harder to search because of the mismatch of names. For example, the universal name of Russia is 'Russian Federation' but here it is referred to as 'Russia.'

It would be better to add another field countryCode which says the universal alpha2 country code of the specific country.

Add La Salle Universitat Ramon Llul

Add "La Salle - University Ramon Llul" (Barcelona, Spain)

{
      "alpha_two_code": "ES",
      "country": "Spain",
      "domain": "salleurl.edu",
      "name": "La Salle - Universitat R\u00e1mon Llull",
      "web_page": "http://www.salleurl.edu"
}

Thanks

Update Indonesian university

Hi,
I want to update Indonesian university data to the current state. There is only some popular university listed here and my quick scan saw that some university have changed their website/domain. Based on our Ministry of Higher Education statistics, currently we have (about) 4507 university listed.
I am planning to create a script to scrap these data and add that to here. What do you think?
However in our beloved developed country, some university haven't use .ac.id/.edu domain for their website yet and use gmail even for their official email. Is it counted as passing criteria here?

adding university

Please add domain for Duy Tan University from Viet Nam, thanks for help
{
"web_pages": [
"http://www.dtu.edu.vn/"
],
"name": "Duy Tan University",
"alpha_two_code": "VN",
"state-province": null,
"domains": [
"dtu.edu.vn"
],
"country": "Viet Nam"
},

Duplicate Names with Same Domains & Web pages

Hi there,
I see that there are issues with same website and domain is associated with multiple entries such as the following is this expected behaviour? there are more than one of such occurances

{
"web_pages": [
"http://www.sjsu.edu/"
],
"name": "California State University, San Jose",
"alpha_two_code": "US",
"state-province": null,
"domains": [
"sjsu.edu"
],
"country": "United States"
},

and

{
"web_pages": [
"http://www.sjsu.edu/"
],
"name": "San Jose State University",
"alpha_two_code": "US",
"state-province": null,
"domains": [
"sjsu.edu"
],
"country": "United States"
},

Data Change

Hi Yiğit,
Current Hacettepe University (hun.edu.tr) domain is old one. Currently, Hacettepe University using "hacettepe.edu.tr". Can you please add/fix it ?

Conflict in Name and Data of Repository !!

This is called “university-domains-list” but it also contains the information of soo many colleges and other Educational Institutes (which can’t be categorized as University) in the JSON file, so I think the Documentation should be updated accordingly !
OR
The data should be restricted to University only.
What do you think @yigitguler ..?

Wrong character in name and country

Please update country and name for following:

{
"alpha_two_code": "CI",
"country": "C\u00f4te d'Ivoire",
"domain": "uabobo.ci",
"name": "Universit\u00e9 d'Abobo-Adjam\u00e9",
"web_page": "http://www.uabobo.ci/"
},
{
"alpha_two_code": "CI",
"country": "C\u00f4te d'Ivoire",
"domain": "ubouake.ci",
"name": "Universit\u00e9 de Bouak\u00e9",
"web_page": "http://www.ubouake.ci/"
},
{
"alpha_two_code": "CI",
"country": "C\u00f4te d'Ivoire",
"domain": "univ-cocody.ci",
"name": "Universit\u00e9 de Cocody",
"web_page": "http://www.univ-cocody.ci/"
},

University of Linz is out of date

Hi,

 {
    "web_pages": [
      "http://www.uni-linz.ac.at/"
    ],
    "name": "Universität Linz",
    "alpha_two_code": "AT",
    "state-province": null,
    "domains": [
      "uni-linz.ac.at"
    ],
    "country": "Austria"
  },

Should actually be:

 {
    "web_pages": [
      "http://www.jku.at/"
    ],
    "name": "Johannes Kepler Universität Linz",
    "alpha_two_code": "AT",
    "state-province": null,
    "domains": [
      "jku.at"
    ],
    "country": "Austria"
  },

See also: https://en.wikipedia.org/wiki/Johannes_Kepler_University_Linz

Cheers,
Lukas

University Logos

Hi Guys,

You have done a great job! Well done, really useful source to be honest.

I just want to ask that is there any way to get universities logos'?

That would be more useful if it can be implemented.

Let me know if you are planning something like that I would love to help.

Thanks

invalid json

When attempting to update my local list this morning, it seemed that my PHP script was failing to grok the JSON file at https://github.com/Hipo/university-domains-list/blob/master/world_universities_and_domains.json

I copied the contents to JSONlint.com, and it gives this:

Error: Parse error on line 37474:
...fi/"		],		"name": Laurea University of
---------------------^
Expecting 'STRING', 'NUMBER', 'NULL', 'TRUE', 'FALSE', '{', '[', got 'undefined'

Manual inspection of the file shows that Laurea University of Applied Sciences was missing a preceding quote. After adding the quote, my local script worked just fine, and JSONlint.com verifies the file as valid JSON.

Please add Georgian Free University

As I see, the record for this university should be like this:
{ "alpha_two_code": "GE", "country": "Georgia", "domain": "freeuni.edu.ge", "name": "Free University of Tbilisi", "web_page": "http://freeuni.edu.ge/" }

Thanks for your effort to keep this list fresh and actual!

University of Aberdeen (UK) - repeated twice

Seems University of Aberdeen appears twice inside the JSON

  {
    "web_pages": [
      "http://www.abdn.ac.uk/"
    ],
    "alpha_two_code": "GB",
    "state-province": null,
    "country": "United Kingdom",
    "domains": [
      "abdn.ac.uk"
    ],
    "name": "University of Aberdeen"
  },
  {
    "web_pages": [
      "http://www.abdn.ac.uk/"
    ],
    "name": "University of Aberdeen",
    "alpha_two_code": "GB",
    "state-province": null,
    "domains": [
      "aberdeen.ac.uk"
    ],
    "country": "United Kingdom"
  },

Fix name and domain of TU Hamburg

"Technische Universität Hamburg-Harburg" was renamed to "Technische Universität Hamburg". The domain has also changed: www.tuhh.de

{
"web_pages": [
"http://www.tuhh.de/"
],
"name": "Technische Universität Hamburg",
"alpha_two_code": "DE",
"state-province": null,
"domains": [
"tuhh.de"
],
"country": "Germany"
},

Add city and state

Would it be possible to add the city and state to the json file? Several universities in the US can have the same name and adding city/state to the results would help distinguish them from one another

Change / Add Hochschule-Bonn-Rhein-Sieg

The university
{
"alpha_two_code": "DE",
"country": "Germany",
"domain": "fh-bonn-rhein-sieg.de",
"name": "Fachhochschule Bonn-Rhein-Sieg",
"web_page": "http://www.fh-bonn-rhein-sieg.de/"
},

is now called

Hochschule-Bonn-Rhein-Sieg

Please add:

{
"alpha_two_code": "DE",
"country": "Germany",
"domain": "hochschule-bonn-rhein-sieg.de",
"name": "Hochschule Bonn-Rhein-Sieg",
"web_page": "http://www.hochschule-bonn-rhein-sieg.de/"
},

There is also a short Domain: h-brs.de

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.