A copyright violation detector running on Wikimedia Cloud Services https://tools.wmflabs.org/copyvios/
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

README.md 2.1 KiB

10 years ago
3 years ago
10 years ago
3 years ago
13 years ago
3 years ago
3 years ago
3 years ago
3 years ago
1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950
  1. This is a [copyright violation](https://en.wikipedia.org/wiki/WP:COPYVIO)
  2. detector running on [Wikimedia Cloud Services](https://copyvios.toolforge.org/).
  3. It can search the web for content similar to a given article, and graphically
  4. compare an article to a specific URL. Some technical details are expanded upon
  5. [in a blog post](https://benkurtovic.com/2014/08/20/copyvio-detector.html).
  6. Dependencies
  7. ============
  8. * [earwigbot](https://github.com/earwig/earwigbot) >= 0.1
  9. * [flask](https://flask.palletsprojects.com/) >= 0.10.1
  10. * [flask-mako](https://pythonhosted.org/Flask-Mako/) >= 0.3
  11. * [mako](https://www.makotemplates.org/) >= 0.7.2
  12. * [mwparserfromhell](https://github.com/earwig/mwparserfromhell) >= 0.3
  13. * [oursql](https://pythonhosted.org/oursql/) >= 0.9.3.1
  14. * [requests](https://requests.readthedocs.io/) >= 2.9.1
  15. * [SQLAlchemy](https://www.sqlalchemy.org/) >= 0.9.6
  16. * [apsw](https://github.com/rogerbinns/apsw) >= 3.26.0
  17. * [uglifyjs](https://github.com/mishoo/UglifyJS) >= 3.12.6
  18. * [cssnano](https://github.com/cssnano/cssnano) >= 4.1.10
  19. * [postcss-cli](https://github.com/postcss/postcss-cli) >= 8.3.1
  20. Running
  21. =======
  22. - If using Toolforge, you should clone the repository to `~/www/python/src`, or
  23. otherwise symlink it to that directory. A
  24. [virtualenv](https://virtualenv.pypa.io/) should be created at
  25. `~/www/python/venv`.
  26. - Install all dependencies listed above.
  27. - Create an SQL database with the `cache` and `cache_data` tables defined by
  28. [earwigbot-plugins](https://github.com/earwig/earwigbot-plugins/blob/develop/tasks/schema/afc_copyvios.sql).
  29. - Create an earwigbot instance in `.earwigbot` (run `earwigbot .earwigbot`). In
  30. `.earwigbot/config.yml`, fill out the connection info for the database by
  31. adding the following to the `wiki` section:
  32. _copyviosSQL:
  33. host: <hostname of database server>
  34. db: <name of database>
  35. If additional arguments are needed by `oursql.connect()`, like usernames or
  36. passwords, they should be added to the `_copyviosSQL` section.
  37. - Run `./build.py` to minify JS and CSS files.
  38. - Start the web server (on Toolforge, `webservice uwsgi-python start`).