typpo / ad-detector Goto Github PK
View Code? Open in Web Editor NEWDetects articles with corporate sponsors.
Home Page: http://www.ianww.com/ad-detector/
License: MIT License
Detects articles with corporate sponsors.
Home Page: http://www.ianww.com/ad-detector/
License: MIT License
Firefox was all set to go, but I had to change the basic injection technique because Chrome sandboxed the 'window' variable, which was needed to detect Mashable and Fastcompany.
Now I have Chrome-specific calls in main.js that need to be removed before a Firefox version is ready.
On this page, Ad Detector says the article is paid for by EY. No idea what EY is and I don't see any fine print indicating that it's a paid piece. http://www.forbes.com/sites/hollieslade/2014/08/21/how-this-man-built-a-3m-business-a-year-on-from-four-years-in-prison/ (The site shows an ad the first time you visit, just load the page a second time to see the article.)
Firefox detects AdDetector to be incompatible with multiprocess (e10s). See https://developer.mozilla.org/en-US/Add-ons/Working_with_multiprocess_Firefox.
Digiday.com is labeled as a native ad. While perhaps not entire untrue, it's still not consistent with the theme you set for the plug-in.
I've got 0.1.12 installed in Chrome. These URLs are all flagging as sponsored:
http://www.theatlantic.com/
http://www.theatlantic.com/404meplease
http://www.theatlantic.com/technology/archive/2014/08/i-didnt-expect-to-find-pornography-in-my-9-year-olds-web-history/375896/?single_page=true
http://www.theatlantic.com/technology/archive/2014/08/i-didnt-expect-to-find-pornography-in-my-9-year-olds-web-history/375896/
I've got a bunch of other extensions so I'm not positive if this is an AdDetector problem alone.
Many thanks, love the tool!
A web extension port is planned?
Examples:
It seems the first two can be determined based on the URL (sponsored-special
/promo
), but the third one can't be - there's an "Advertorial" badge on the page, though.
http://recode.net/sponsored-content/the-future-of-content-its-a-journey-not-a-destination/
http://studioatgawker.kinja.com/your-wimpy-workout-has-nothing-on-houdinis-bizarre-trai-1613321041
http://www.bizjournals.com/houston/bizwomen/channels/pnc/2014/08/for-businesses-green-is-here-to-stay.html
http://thoughtcatalog.com/thought-catalog-sponsored-posts/2014/07/23-incredible-summer-photos-that-will-make-you-book-your-vacation-tomorrow/
http://gearpatrol.com/2014/08/04/nobody-wins-counterfeit-batteries/
http://tribtalk.org/2014/07/30/fixing-whats-wrong-with-testing-in-k-12-education/
http://www.newrepublic.com/article/118883/millennials-dont-want-climb-traditional-career-ladder
http://www.msnbc.com/msnbc/landfill-fuels-frances-energy-future
http://www.politico.com/magazine/sponsor-content/2014/07/to-save-detroit-start-small/#.U9fGvoBdVv0
http://www.adweek.com/news/technology/how-cnn-and-cisco-made-web-series-about-progressive-cities-158983
http://www.phillymag.com/sponsor-content/grilling-gets-healthy-makeover/
http://www.xojane.com/we-were-paid-for-this/my-favorite-things-lasvegas-day
http://gothamist.com/2014/06/16/opposites_attract_the_best_nyc_date_1.php
http://arstechnica.com/sponsored/it-consumption-model-interactive-infographic/
http://www.thedailybeast.com/articles/2014/05/29/will-you-choose-a-conflict-free-microprocessor.html
http://www.apartmenttherapy.com/when-to-spend-and-when-to-save-sauder-202896
http://www.bloomberg.com/native/article/?mvi=8a0a327343df4ca0b503d03a93deebd2
http://www.thestar.com/sponsored_sections/dailyhealth.html
https://www.yahoo.com/tech/spark-ignites-match-how-sam-yagan-became-the-dominant-a81624092647.html
http://www.bostonmagazine.com/sponsor-content/healthy/
http://www.foodrepublic.com/2014/05/15/whimsical-grilling-leif-hedendal
http://www.vice.com/ketelone/modern-craftsmen-gregory-laketek
http://www.marketwatch.com/ad/article/narratives?prx_t=RpABA5q0BAICAFA&prx_q=8163
http://www.today.com/home/chip-it-sherwin-williams-1D79856708
http://www.foodnetwork.com/recipes/orange-sorbet-recipe.html?COUPON=07-0094-01&bid=847241
http://business.financialpost.com/2013/06/26/a-joint-venture-with-capp-oil-industry-set-to-grow/?__lsa=4d92-1f6b
http://www.canadianfamily.ca/kids/baby/your-guide-to-baby-massage-sponsored/
http://www.inc.com/theupsstore/index.html
http://www.outsideonline.com/adventure-travel/travel-articles/2014-Adventure-Bucket-List.html
http://readwrite.com/2014/03/14/api-explainer-intel#awesm=~oDCS7ICV27l5BC
http://www.upworthy.com/30-seconds-of-women-over-apologizing-followed-by-30-seconds-of-them-so-destroying-that-stereotype?c=upw1&utm_content=bufferad514&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
http://pioneerlocal.suntimes.com/sponsored/?post=1486-What%20are%20the%20possible%20causes%20of%20your%20ankle%20pain%3F&sponsor=342412-Hinsdale%20Foot%20%26%20Ankle
http://www.dnainfo.com/new-york/sponsor-story/?mvi=8ca5f068f5b249b88f359b7dcc8ca6a0
http://www.arlnow.com/2014/07/11/your-beermonger-comparing-beer-prices-2005-to-today/
http://mic.com/articles/96860/two-engineers-have-created-the-doll-every-young-girl-should-be-playing-with
http://qz.com/241614/reducing-indias-dependence-on-foreign-oil-and-gas/
http://www.oracle.com/us/corporate/profit/partner-news/050814-informatica-2202772.html
http://www.mymodernmet.com/7-Artists-Create-Intricate-Works-Assembling-Thousands-Pieces
http://www.vox.com/sponsored/goldman-sachs-naes/episode-1-the-north-american-energy-opportunity
http://talkingpointsmemo.com/idealab-impact/medicare-partd-affordable-drug-coverage
The extension should occasionally retrieve updated rules from a server.
Some people may not like the idea of a server updating their rules arbitrarily, so this will have to be done carefully and correctly.
Are there plans to add a Safari Plugin as well?
/advertorial/
The homepage for Gawker throws the plugin's banner.
Similar to rtlnieuws.nl
, but in this case the sponsor is clearly listed on the page.
Example: http://www.rtlz.nl/sponsored-special/sneller-diagnose-dankzij-toepassing-dvd-technologie
See the above url for a false positive.
The repository appears to contain version 0.0.20, but the version in the Chrome store is stuck on 0.0.19. Is this intended?
ad-detector marks this article as sponsored by CenturyLink,
http://www.forbes.com/sites/robertglatter/2013/06/21/the-dangers-of-smoking-alcohol/
even though the only mention of centurylink in the html source is in the link to another article in the rotating header. Unless ad-detector knows something that is not shown on the page, this is wrong.
Another important area to increase awareness of ads is entertainment media. Let's also focus on adding a few large entertainment publications.
"Candidate Running for BART Board Buys Bold Italic Journalist's ‘Endorsement’".
_Clarification:_ Bold Italic Editor Jennifer Maerz said she wants to make it clear the Bold Italic was not making an endorsement, rather providing sponsored content[.]
Well that clears everything up.
Just adding it here as a to-do; I haven’t had the time to write the script to detect sponsored content, but clearly this is a perfect example of a site that needs Ad Detector to put a big, fat warning on it.
Ad detector says everything on the Daily Beast is an Ad.
http://www.thedailybeast.com/
We have rules for most, but not all, from this list:
https://en.wikipedia.org/wiki/List_of_newspapers_in_the_United_States_by_circulation
I think we also need more rules for online-only media sources (eg. techcrunch, zdnet, cnet).
This will allow users to submit pages that are currently not recognized.
Example URL: https://motherboard.vice.com/read/the-swedish-fish-theory
Not sure how to detect this one, as it appears to be a manually added image. I don't know whether they have any other sponsored items, either.
eom
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.