A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
Ben Kurtovic a07d12699a Finally fix #3; speed up highlighter with a deque. преди 10 години
copyvios Finally fix #3; speed up highlighter with a deque. преди 10 години
logs Always have a log dir. преди 10 години
static Adjust indent size; fix rendering of strings. преди 10 години
templates Note that PDF parsing is now supported. преди 10 години
.gitignore Fix gitignore for logs. преди 10 години
.lighttpd.conf Begin conversion to Flask; updates. преди 10 години
LICENSE Updates. преди 10 години
README.md Store what was previously in SQL tables in flask.g. преди 10 години
app.fcgi Cleanup; call update_sites() before API requests. преди 10 години
build.py Begin conversion to Flask; updates. преди 10 години

README.md

This is a copyright violation detector running on Wikimedia Labs.

It can search the web for content similar to a given article, and graphically compare an article to a specific URL.

Dependencies

Running

  • Install all dependencies listed above. You might want to use a virtualenv.

  • Create an SQL database with the cache and cache_data tables defined by earwigbot-plugins.

  • Create an earwigbot instance in .earwigbot (run earwigbot .earwigbot). In .earwigbot/config.yml, fill out the connection info for the database by adding the following to the wiki section:

      _copyviosSQL:
          host: <hostname of database server>
          db:   <name of database>
    

    If additional arguments are needed by oursql.connect(), like usernames or passwords, they should be added to the _copyviosSQL section.

  • Copy .lighttpd.conf to the relevant location (on Tool Labs, this is in the root of the project’s home directory) and adjust its contents as necessary.

  • Run ./build.py to minify JS and CSS files.

  • Adjust the hashbang in app.fcgi to point to the correct Python interpreter or virtual environment.

  • Start lighttpd (on Tool Labs, webservice start).