A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

README.md 2.0 KiB

11 years ago
13 years ago
10 years ago
10 years ago
10 years ago
10 years ago
10 years ago
12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849
  1. This is a [copyright violation](https://en.wikipedia.org/wiki/WP:COPYVIO)
  2. detector running on [Wikimedia Labs](https://tools.wmflabs.org/copyvios).
  3. It can search the web for content similar to a given article, and graphically
  4. compare an article to a specific URL. Some technical details are expanded upon
  5. [in a blog post](http://benkurtovic.com/2014/08/20/copyvio-detector.html).
  6. Dependencies
  7. ============
  8. * [earwigbot](https://github.com/earwig/earwigbot) >= 0.1
  9. * [flask](http://flask.pocoo.org/) >= 0.10.1
  10. * [flask-mako](https://pythonhosted.org/Flask-Mako/) >= 0.3
  11. * [mako](http://www.makotemplates.org/) >= 0.7.2
  12. * [mwparserfromhell](https://github.com/earwig/mwparserfromhell) >= 0.3
  13. * [oursql](http://packages.python.org/oursql/) >= 0.9.3.1
  14. * [requests](http://python-requests.org/) >= 2.9.1
  15. * [SQLAlchemy](http://sqlalchemy.org/) >= 0.9.6
  16. * [apsw](https://github.com/rogerbinns/apsw) >= 3.26.0
  17. * [uglifycss](https://github.com/fmarcia/UglifyCSS/)
  18. * [uglifyjs](https://github.com/mishoo/UglifyJS/) >= 1.3.3
  19. Running
  20. =======
  21. - If using Tool Labs, you should clone the repository to `~/www/python/src`, or
  22. otherwise symlink it to that directory. A
  23. [virtualenv](http://virtualenv.readthedocs.org/) should be created at
  24. `~/www/python/venv`.
  25. - Install all dependencies listed above.
  26. - Create an SQL database with the `cache` and `cache_data` tables defined by
  27. [earwigbot-plugins](https://github.com/earwig/earwigbot-plugins/blob/develop/tasks/schema/afc_copyvios.sql).
  28. - Create an earwigbot instance in `.earwigbot` (run `earwigbot .earwigbot`). In
  29. `.earwigbot/config.yml`, fill out the connection info for the database by
  30. adding the following to the `wiki` section:
  31. _copyviosSQL:
  32. host: <hostname of database server>
  33. db: <name of database>
  34. If additional arguments are needed by `oursql.connect()`, like usernames or
  35. passwords, they should be added to the `_copyviosSQL` section.
  36. - Run `./build.py` to minify JS and CSS files.
  37. - Start the web server (on Tool Labs, `webservice2 uwsgi-python start`).