Ben Kurtovic
0692227496
Cleanup/tweaks, mainly to rendering.
9 år sedan
Ben Kurtovic
6d940b5c92
ast.literal_eval -> json.loads
9 år sedan
Ben Kurtovic
83f5588354
Disable turnitin by default for now, until we confirm it works as expected.
9 år sedan
Frances Hocutt
4e994f1302
Refactor turnitin.py, incorporate diff link/timestamp
* Add a wiki timestamp parser to copyvios/misc.py
* Refactor copyvios/turnitin.py for more sensible structure
* Update templates/index.mako to incorporate diff link/timestamp and
make it clearer that Turnitin is revision-based checking
9 år sedan
Frances Hocutt
1ffa87da0b
Improve turnitin.py docstrings, fix bugs
9 år sedan
Frances Hocutt
bf0aa22fa9
[WIP] improve docstrings and naming, mark TODO
9 år sedan
Frances Hocutt
efe2300250
[WIP] Basic working integration of turnitin
Add a checkbox to allow searching the EranBot/plagiabot database for
Turnitin results, and display them in a similar form to the on-wiki
EranBot reports if they exist.
Add a new module (copyvios/turnitin.py) to handle fetching and parsing
the EranBot results.
Bug: https://phabricator.wikimedia.org/T110144
TODO: tweak display HTML/CSS; refactor/clean up turnitin.py;
improve dev set-up so it doesn't always default to testwiki
and can test without hardcoding page title
9 år sedan
Ben Kurtovic
ad033a6e2b
Pass excluded URL data to the API.
9 år sedan
Ben Kurtovic
321e6e3352
Support new 'excluded' column in database.
9 år sedan
Ben Kurtovic
4092cbc0f6
Keep all workers' backgrounds in sync, and ensure they update at 00:00 UTC.
9 år sedan
Ben Kurtovic
b325ef84cd
Support general/update notices.
9 år sedan
Ben Kurtovic
1327f57404
Fix backgrounds when the image has been moved.
9 år sedan
Ben Kurtovic
f2c1d5b853
Make cache.langs, cache.projects sorted lists.
10 år sedan
Ben Kurtovic
0f4ecfe80a
Auto-bypass cache when cache_possible_miss is True.
10 år sedan
Ben Kurtovic
81552f28d3
Bugfix; missed column.
10 år sedan
Ben Kurtovic
10e75c7b7a
Cache 'possible_miss' result value.
10 år sedan
Ben Kurtovic
331b05baae
Truncate URLs above 1024 chars.
10 år sedan
Ben Kurtovic
a07d12699a
Finally fix #3 ; speed up highlighter with a deque.
10 år sedan
Ben Kurtovic
f4ff4099be
Fix old reference to query.all_projects.
10 år sedan
Ben Kurtovic
a1d8b79265
Fix recursion bug.
10 år sedan
Ben Kurtovic
6eed95c761
Use a global cache instead of the request-specific one.
10 år sedan
Ben Kurtovic
ddcf2a8963
Store what was previously in SQL tables in flask.g.
10 år sedan
Ben Kurtovic
9b07a5e9dc
OrderedDict requires extra parens.
10 år sedan
Ben Kurtovic
340ba07c9d
Generate API results as ordered dicts.
10 år sedan
Ben Kurtovic
6d87ad41ed
Rearrange functions in api.py.
10 år sedan
Ben Kurtovic
2fad6b0b60
Description of API response data.
10 år sedan
Ben Kurtovic
89feddba98
Some minor tweaks, plus documentation for API parameters.
10 år sedan
Ben Kurtovic
6639338fa8
First version of new API (for #7 )
10 år sedan
Ben Kurtovic
6f574e37a2
Pluralize cache times correctly.
10 år sedan
Ben Kurtovic
db7a4f2ec4
Clean up query data handling.
10 år sedan
Ben Kurtovic
b3e1862726
Catch and display search errors more usefully.
10 år sedan
Ben Kurtovic
b7bafb29cf
Implement noskip.
10 år sedan
Ben Kurtovic
73315e3370
FML.
10 år sedan
Ben Kurtovic
5242d31a93
Whoops.
10 år sedan
Ben Kurtovic
0731eb089d
Cleanup; fix confidence value comparison; fix.
10 år sedan
Ben Kurtovic
7cd9a53209
Fix oursql coercing binary data to unicode.
10 år sedan
Ben Kurtovic
b2fba1fa56
Implement new caching system and result structure.
10 år sedan
Ben Kurtovic
c6b9c9ebe6
Really fix checkbox blank value treatment.
10 år sedan
Ben Kurtovic
6cd194f971
Catch empty method error; fix blank values in query.
10 år sedan
Ben Kurtovic
b90e5e7279
Fix logic.
10 år sedan
Ben Kurtovic
c165e97fb2
Handle empty URLs from cached results correctly.
10 år sedan
Ben Kurtovic
710447a6bf
Slightly modified interface; give more options ( closes #17 ).
10 år sedan
Ben Kurtovic
8f6a02a8f3
If no scheme is given for the URL, add http://
10 år sedan
Ben Kurtovic
8d80eccad0
T_POSSIBLE back down to 0.4.
10 år sedan
Ben Kurtovic
490022910f
Update T_POSSIBLE, T_SUSPECT.
10 år sedan
Ben Kurtovic
9e13ac1c04
Increase log message verbosity; fix import.
10 år sedan
Ben Kurtovic
ce96f0f8a6
Use a yellow 'possible' box for confidence between 0.3 and 0.6 ( closes #5 ).
10 år sedan
Ben Kurtovic
75b627cfa5
Refactor out page.copyvio_compare() and surrounding code.
10 år sedan
Ben Kurtovic
cf66804fe5
Give a different error message for timeouts. Cheating a bit.
10 år sedan
Ben Kurtovic
7d426a4eb0
Support following redirects ( closes #6 ).
10 år sedan