GithubHelp home page GithubHelp logo

copy05 / copyrightarmor Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 54 KB

Fighting against piracy on the web

Python 100.00%
anti-piracy copyright-protection copyright-scan dmca dmca-takedown fighting-piracy copy05 content-management content-matching takedown

copyrightarmor's Introduction

CopyrightArmor

The only Open Source Tool to fight against piracy.

CopyrightArmor is a tool that scans and detects for pirated content for special research or to takedown. We aim for decentralisation and independence of DMCA Reporting Agencies.

Features

  • Detecting Stolen Images: CopyrightArmor detects stolen images by matching sha256 "hashes"
  • Detecting Stolen Files: CopyrightArmor detects stolen files by matching sha256 "hashes"
  • Web Scraping Engine: CopyrightArmor uses a robust web scraping engine to crawl and analyze websites for potentially infringing content.
  • Recursive Scrapping: Recursively scans websites.
  • Google Optimized: by using --google and --google-search without -url the web scraping engine will be optimized for Google Search.
  • Google SERP Engine: CopyrightArmor detects most of pirate sites on the Google Search results. (always check for false positives)
  • Flexible: You can configurate how and what type of links it should scrape:
    • Exclude Social Media Links
    • Exclude Query parameter links
    • Exclude External Links
  • Generate DMCA Takedown Letters: by using create-report.py inside the report directory.
  • Detailed Reports: Generate detailed reports of scanned URLs and domain statistics to keep records of your scanning activities:
CopyrightArmor 10/30/23 Report

Scanned URLs (5):
https://www.iana.org/domains/example
https://example.com
https://www.iana.org/protocols
https://www.iana.org/domains
https://www.iana.org/

Scanned Domains (2):
www.iana.org - 4 URLs
example.com - 1 URLs

Installations

  1. Clone the CopyrightArmor repository from GitHub:
git clone https://github.com/Copy05/CopyrightArmor.git
  1. Navigate to the project directory:
cd CopyrightArmor
  1. Install the required dependencies using pip:
pip install -r requirements.txt
  1. Open up src/hashes.json and add all content that you want to scan for and use this syntax:

--google-search scans only for the content inside "entertainment"

{
    "images": [
        {
            "original_url": "https://t3.ftcdn.net/jpg/05/41/71/02/360_F_541710260_3qZwn627nHyHA99xIktU7tKotn93YGjw.jpg",
            "copyright_owner": "Example Corp.",
            "hash": "3b99f49776f433aeb000fa010e452879198fe4be7f6660552527b53304268342",
            "description": "A Pink Cherry Tree"
        },
    ],
    "files": [
        {
            "original_url": "https://t3.ftcdn.net/jpg/05/41/71/02/361_F_541710260_3qZwn627nHyHA99xIktU7tKotn93YGjw.exe",
            "copyright_owner": "Example Corperation.",
            "hash": "3b99f49776f433aeb000fa010e452879198fe4be7f6660552527b53304268343",
            "description": "Another Pink Cherry Tree Exe"
        }
    ],
    "entertainment": [
        {
            "title": "EntertainRewind 2024",
            "original_url": "example.com",
            "copyright_owner": "Entertainmasters",
            "hash": "c8392dc67d913d39664e0fc400280a2de03107348f7432e226194f0a7e4eeebe"
        }
    ]
}

Contribution

I do accept contributions to this project. create a pull request and describe your changes !

Most Targetted Copyright Owners

To see how much piracy happened on the internet here is a graph:

Catagory URLs
MG Premium LTD (Aylo) +1,092,671,418
Music +742,100,209
Movies / Television +741,019,179
Webtoons +711,855,406
Anime +443,879,761
Manga +195,188,170
Models +167,086,838
XXX +11,971,422
WGCZ (BangBros) +11,355,801

copyrightarmor's People

Contributors

copy05 avatar snyk-bot avatar

Stargazers

 avatar

Watchers

 avatar

copyrightarmor's Issues

`depth` dosn't work like intended

It'll be a easy fix.

# There should be a explicit check for if depth reached 0. it will exit (SystemExit)
if depth is not None and depth <= 0:
        return
    
    if depth is not None:
        depth -= 1

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.