GithubHelp home page GithubHelp logo

notracking / hosts-blocklists Goto Github PK

View Code? Open in Web Editor NEW
2.3K 66.0 147.0 2.39 GB

Automatically updated, moderated and optimized lists for blocking ads, trackers, malware and other garbage

blocklist hostfile adblock dns dnsmasq ads blocklists hostname domain dnscrypt-proxy

hosts-blocklists's Introduction

The NoTracking blocklist will be shutting down soon. Please read #900 for more details.

No more ads, tracking and other virtual garbage

The NoTracking blocklist is a DNS based filter list for blocking ads, malware, phising and other online garbage.

General blocklist policies

  • Should not break useful and commonly used services
  • Blocks tracking servers
  • Blocks advertising servers
  • Blocks analytics servers
  • Blocks scam websites
  • Blocks malware servers
  • Blocks webminers
  • Blocks phishing servers

Optimization

The optimizer makes full use of domainname based wildcard filtering *.adhost.net, this reduces the chance of missing any new subdomains and significantly reduces the size of the blocklists.

Dead hosts removal

All hostnames are constantly monitored for updates. In case the A, AAAA, CNAME and NS records return NXDOMAIN they will be marked as dead and removed. Domains are tested on their whois data and removed if they have been unregistered for a certain time.

The current list of dead hostnames can be found here and have a look here for all unregistered domains.

Sources

Most sources come from public hostfile type lists, though several AdblockPlus lists are also included only for their non-3rd party networking filters ||evilhost.com^. See SOURCES.md for the full overview of all included lists.

Versions

List Compatibility
dnsmasq/dnsmasq.blacklist.txt Dnsmasq
adblock/adblock.txt Adguard Home, uBlock Origin, Pi-Hole
dnscrypt-proxy/dnscrypt-proxy.blacklist.txt Dnscrypt-proxy
unbound/unbound.blacklist.conf Unbound
hostnames.txt & domains.txt Dnsmasq (version < 2.80 only, use both files)

How to install

Supporting Notracking blocklist

If you are intrested in supporting the project you can:

  • Submit false positives
  • Donate with Paypal
  • Donate Bitcoin: 37VkbtMDgxCiHaCSS4PNPDo2z5AwM8grB2
  • Donate Ethereum: 0xaf446938cd43de5a267e9596a40be55a86f6b4a8

hosts-blocklists's People

Contributors

jasrys avatar k03mad avatar notracking avatar ssrebelious avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

hosts-blocklists's Issues

Question: Do I need to restart dnsmasq after updating files?

As the title stands, if overwriting the files without restarting the dnsmasq service, will they still be read or is it impossible to retain the cache and still get updates to the rules (which would obviously only work once the ttl of the dns in the cache expires)?

Just unsure how this thing works, never used dnsmasq quite this way.

Aliexpress.com broken lately

Hey!
https://www.aliexpress.com/ Doesn't work anymore (started somewhere this week), because the alicdn.com is blocked on the domains.txt. This is a cdn for the alibaba shopping site, more popular than ebay, so I don't really think it should be blocked. It is responsible of bringing the content (images, scripts, layouts etc) to users, with it blocked the website is mostly plain html.

Single tar.gz file

Would it be possible to provide a single tar.gz file containing domains.txt and hostnames.txt?

I may have used your lists.

I followed the advise of W3K it makes it easer to download for the Pi-Hole.

W3K said.
We’d suggest checking out GH pages. It’s free and PH won’t redownload the blocklist if there’s no new changes - W3K
https://twitter.com/The_Pi_Hole/status/920758922121641984

If you would like to have a special note could you create an issue, I have also done work on you list to make it more acceptable for the Pi-hole.
https://github.com/smokingwheels/smokingwheels.github.io
At the moment I need a bit of time to relax and have a rest.

Not blocking everything?

Hey,

I've noticed that after applying these to my dns server that it's not blocking a lot of youtube ads and it also breaks log-in through facebook.

Great work though and thank you so much for maintaining it!

Online betting site blocked

Hi, the online betting site Bet365 is currently getting blocked due to being listed.

address=/bet365.com/0.0.0.0 

Many thanks for the blocklist not seen many fp over the last few days of using it.

No boundaries for user identities: Web trackers exploit browser login managers

In this second installment of the No Boundaries series, we show how a long-known vulnerability in browsers’ built-in password managers is abused by third-party scripts for tracking on more than a thousand sites...

Source : https://freedom-to-tinker.com/2017/12/27/no-boundaries-for-user-identities-web-trackers-exploit-browser-login-managers/

You can test the attack yourself on our live demo page : https://senglehardt.com/demo/no_boundaries/loginmanager/

List of sites embedding scripts that abuse login manager for tracking : https://webtransparency.cs.princeton.edu/no_boundaries/autofill_sites.html

Youtube is blocked

☺ ping www.youtube.com
PING youtube-ui.l.google.com (216.58.214.238): 56 data bytes
64 bytes from 216.58.214.238: icmp_seq=0 ttl=54 time=25.955 ms
64 bytes from 216.58.214.238: icmp_seq=1 ttl=54 time=24.695 ms

And youtube-ui.l.google.com is added to the list of blocked hosts.

Overblocking on web.de because i0.web.de is blocked

I am using your hostnames.txt with the Pi-hole software and detected today, that the news-teaser-images on web.de are not shown on my mobile, because the domain i0.web.de gets blocked.

When I open up web.de on the PC, all images gets loaded, because they gets loaded from another domain.

So could you please remove i0.web.de from the list?
Thank you very much.

Marry Christmas 🎄🎅

Possible broken entries in hosts file

The ones below look weird to me, and they do not resolve anyway:

0.0.0.0 ms0.0.0.001341.club
0.0.0.0 q28840.nb.host0.0.0.0.com
0.0.0.0 w4988.nb.host0.0.0.0.com
0.0.0.0 w612.nb.host0.0.0.0.com
0.0.0.0 z32538.nb.host0.0.0.0.com

Adult context

Does this include blocking adult sites? If no, do you know a list that I can concatenate that does?

Special Character Causing Load to Fail on CentOS 6.9

Greetings,

When attempting to start dnsmasq after following the instructions, the following error is returned:

Starting dnsmasq:
dnsmasq: error at line 59565 of /etc/dnsmasq.d/domains.txt [FAILED]

This corresponds to the entry address=/streamillimité.com/0.0.0.0. I believe the special char is causing it to fail. If I comment the entry out it works fine.

Great project

False positive: schibsted.io

Although the company seems somehow involved with advertising, it provides a number of services as well; the whole domains is blocked, but I'm quite sure that at least the

widget.messaging.schibsted.io

domain provides legit functionality.

Running pihole -g resets list

Using PH 2.6.3, and following the directions.

After doing pihole -g the adlist.default file is put in its original state like it's being rewritten. I was expecting my previous saved changes to stick.

  • Uncomment the following two lines in /etc/pihole/adlist.default

https://raw.github.com/notracking/hosts-blocklists/master/hostnames.txt
https://raw.github.com/notracking/hosts-blocklists/master/domains.txt

  • Comment out any other list that is already included
    
  • Update blocklists with `pihole -g`
    

Domain 'inside-graph' wrong?

In the domains list there is an entry:

address=/inside-graph/0.0.0.0

That seems wrong to me. The TLD does not exist and it doesn't resolve unless it would be an intranet/inside/private domain.

Activate the Wiki

Hello,
I want to share some code VBScript for:

  • automatically update the hosts file (windows)
  • to format the Wildcard DNS for the Acrylic_DNS_Proxy AcrylicHosts.txt file
  • to format Wildcard DNS for Unbound unbound.conf (DNS server)

is it possible to activate the Wiki?

Thanks

Entry in hosts file needs to be split?

Looks like this line:

0.0.0.0 zzwzjidz.bid0.0.0.0 1beb2a44.space

... needs to be split into two lines?

As in:

0.0.0.0 zzwzjidz.bid
0.0.0.0 1beb2a44.space

cedexis.net should be removed from domains.conf

Breaks mozilla things

dig download.cdn.mozilla.net

; <<>> DiG 9.10.3-P4-Debian <<>> download.cdn.mozilla.net
;; global options: +cmd
;; Got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 45094
;; flags: qr rd ra; QUERY: 1, ANSWER: 5, AUTHORITY: 0, ADDITIONAL: 1

;; ANSWER SECTION:
download.cdn.mozilla.net. 19 IN CNAME 2-01-2967-001e.cdx.cedexis.net.
2-01-2967-001e.cdx.cedexis.net. 20 IN CNAME wildcard.cdn.mozilla.net.edgesuite.net.
wildcard.cdn.mozilla.net.edgesuite.net. 4942 IN CNAME a1284.dscg.akamai.net.
a1284.dscg.akamai.net. 20 IN A 194.177.211.136
a1284.dscg.akamai.net. 20 IN A 194.177.211.139

whitelist domains and suggestions for other lists

Hello!

Thank you for the excellent combined (wildcard) list. I've found some domains, that, I believe, do not belong in this list.

bing.net, virtualearth.net - Bing search and Bing maps
here.com - Nokia Here maps used by Facebook.
gearbest.com - popular electronic store.
jwpcdn.com, jwplatform.com, jwpltx.com - The JWPlayer video platform.
h-cdn.com - HolaCDN - CDN for video delivery. Used for video streaming.
symcd.com - Symantec CA. Used for verifying SSL certificates.
performgroup.com - Digital sports content and media group. Domain used for content delivery to other Perform brands. Example, goal.com
po.st - Link shortener service used mostly on social networks.

Some suggestions for your (wildcard) list:
DShield.org Suspicious Domain List
Mailscanner list of phishing sites
Mitchell Krogs Badd-Boyz-Hosts
WaLLy3K database of lists from various sources.

Host 'xiaoqiang' wrong?

There is an entry in the hosts file:

0.0.0.0 xiaoqiang

That seems to be wrong, it doesn't seem to resolve, and as a TLD it doesn't exist either.

subdomains of domains, in domains.txt

Hi

Examples:

domains.txt contains the likes of:

collect.igodigital.com and
igodigital.com

c.mystat-in.net and
mystat-in.net

hit.stat24.com and
stat24.com

As I understand it, you should be just listing the domains, and the subdomains will be automatically blocked. So I'm wondering why you're listing subdomains of domains that are listed in the domains.txt file?

More to whitelist

Hi,
samsungrm.net domain is needed for remote management of TV-sets by Samsung support.

File retrieval URL incorrect?

I'm not sure if this is a pi-hole specific issue, however I needed to change the URLs in pi-hole's default list file from:

https://raw.github.com/notracking/hosts-blocklists/master/...

to

https://raw.githubusercontent.com/notracking/hosts-blocklists/master/...

to get pi-hole to actually pull the lists. The URL as is just causes pi-hole to report no changes and not download anything.

Whitelist Websites that should show Ads

Hi,

i have a question. Is it possible to whitelist domains where the blacklist would be ignored? In my case some websites are not callable when they detects that a Adblocker is installed (e.g. www.spiegel.de). Is there any possibility to whitelist www.spiegel.de that on this site the Ads are shown?

Thanks
Regards
Manuel

Why two files

excuse my question but why are there two files needed ? Can't this be achived by just domains.txt? Also is there a way to allow for ads but to block tracking?

Reason, many sites now require ads but it is still desirable to block tracking.

jdownloader.org blocked

hi,
big thanks to the list=]
my question is why jdownloader.org is blocked, maybe it can be removed,
i have a vm wich use the jdownloader api api.jdownloader.org and found that problem with that list.

thanks=]

dnsmasq doesn't like 127.0.0.1

The blocklist seems to contain the localhost. This causes an error:

dnsmasq:: ignoring nameserver 127.0.0.1 - local interface

Google hangouts on desktop is blocked

Hi, this list was working great for me but today I noticed sending messages signed in to my Gmail account via hangouts were not sending (but they sent from my phone, must use different servers). I found out it was related to this hosts file. I am not exactly sure how to find the exact domain that it is blocking, if you help me find out how to do that I will report the domain. Thanks.

dansguardian.org

Hello.

grep -r dansguardian scripts/rag-adblocker/tld/
scripts/rag-adblocker/tld/notracking-domains.tld:14792:address=/dansguardian.org/192.168.25.254

Should dansguardian.org really be on the blacklist?

DansGuardian [...] is content-control software designed to control which websites users can access. It also includes virus filtering and usage monitoring features.

More to whitelist.

Hi, blocking graph.facebook.* and graph.instagram.* breaks these mobile clients.
Also, blocking the whole kiev.ua zone is definitely an overkill.

domains for the whitelist?

Hello!
I'm not sure why these domains are blocked. Maybe there was some problem before, bum not now.
tenforums.com - Forum
eightforums.com - Forum
jwplatform.com - The JWPlayer video platform
ooyala.com - The Ooyala Player. Example, used on accuweather.com to play videos.

The next one is problematic.
googletagmanager.com

Example, go to https://www.asus.com/Laptops/N550JX/HelpDesk_Download/.
Choose operating system. The drivers list won't show up. Removing the googletagmanager from the blacklist solves the problem.

Suggestions for your list:

  1. Airelle Trackers - mirror, official
  2. Shalla Spyware/ Malicious - mirror, official
  3. AdguardDNS list - mirror, official
  4. Fabrice Prigent's ads, malware and phishing lists - mirror-ads, mirror-malware, mirror-phishing, official-ads, official-malware, official-phishing
  5. BarbBlocks list - official
  6. Joe Wein spam list - official
  7. bkrcrc turk adlist - official

Support for IPv6

Please consider supporting IPv6.

The way you can do that is to duplicate every line in the hosts.txt file and replace on the duplicated line the0.0.0.0 with ::1

For example:

0.0.0.0 0000mps.webpreview.dsl.net
::1 0000mps.webpreview.dsl.net
0.0.0.0 0001.2waky.com
::1 0001.2waky.com
0.0.0.0 000.gaysexe.free.fr
::1 000.gaysexe.free.fr
0.0.0.0 000owamail0.000webhostapp.com
::1 000owamail0.000webhostapp.com

etcetera

Probable improvement.

Hi, have you considered constructing the domains.txt file without specifying the IP address and thus making dnsmasq return NXDOMAIN?

What pros and cons do you see here?
My list is like the following:
Pros:

  • shorten the error path for applications getting false IP address, i.e. no further connection attempts.

Cons:

  • hostnames.txt are still to return IPs, i.e. two different results of the two files
  • (haven't checked) old versions of dnsmasq might not support the record w/o IP

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.