A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
Du kannst nicht mehr als 25 Themen auswählen Themen müssen entweder mit einem Buchstaben oder einer Ziffer beginnen. Sie können Bindestriche („-“) enthalten und bis zu 35 Zeichen lang sein.
 
 
 
 
 
Ben Kurtovic fa181ad15b Link to some technical details. vor 10 Jahren
copyvios Finally fix #3; speed up highlighter with a deque. vor 10 Jahren
logs Always have a log dir. vor 10 Jahren
static Adjust indent size; fix rendering of strings. vor 10 Jahren
templates Note that PDF parsing is now supported. vor 10 Jahren
.gitignore Fix gitignore for logs. vor 10 Jahren
.lighttpd.conf Begin conversion to Flask; updates. vor 10 Jahren
LICENSE Updates. vor 10 Jahren
README.md Link to some technical details. vor 10 Jahren
app.fcgi Cleanup; call update_sites() before API requests. vor 10 Jahren
build.py Begin conversion to Flask; updates. vor 10 Jahren

README.md

This is a copyright violation detector running on Wikimedia Labs.

It can search the web for content similar to a given article, and graphically compare an article to a specific URL. Some technical details are expanded upon in a blog post.

Dependencies

Running

  • Install all dependencies listed above. You might want to use a virtualenv.

  • Create an SQL database with the cache and cache_data tables defined by earwigbot-plugins.

  • Create an earwigbot instance in .earwigbot (run earwigbot .earwigbot). In .earwigbot/config.yml, fill out the connection info for the database by adding the following to the wiki section:

      _copyviosSQL:
          host: <hostname of database server>
          db:   <name of database>
    

    If additional arguments are needed by oursql.connect(), like usernames or passwords, they should be added to the _copyviosSQL section.

  • Copy .lighttpd.conf to the relevant location (on Tool Labs, this is in the root of the project’s home directory) and adjust its contents as necessary.

  • Run ./build.py to minify JS and CSS files.

  • Adjust the hashbang in app.fcgi to point to the correct Python interpreter or virtual environment.

  • Start lighttpd (on Tool Labs, webservice start).