Ben Kurtovic
b134ef67ba
Even better error handling
3 jaren geleden
Ben Kurtovic
c829590bc9
Bugfix
3 jaren geleden
Ben Kurtovic
ad87596e37
Better error handling for revision IDs
3 jaren geleden
Ben Kurtovic
eacb294735
Update URLs
4 jaren geleden
Ben Kurtovic
4cebe36eee
Validate site info if not in sites DB
5 jaren geleden
Ben Kurtovic
96ac14ec38
Use rvslots for fetching revision content
5 jaren geleden
Ben Kurtovic
c73179b28c
Update highlighter for new MarkovChain implementation
5 jaren geleden
Ben Kurtovic
01cd6dc763
Make caching advisory and reduce db lock duration
5 jaren geleden
Ben Kurtovic
8321a07ba9
More additions for sqlite support
5 jaren geleden
Ben Kurtovic
ca58aac413
Add sqlite3 backend to replace mysql
5 jaren geleden
Ben Kurtovic
29861c1118
Revert previous; literal_eval is necessary here. Add error handling.
6 jaren geleden
Ben Kurtovic
a63e4e2236
Safer parsing for Turnitin API result.
6 jaren geleden
Ben Kurtovic
79866da842
Strip whitespace.
7 jaren geleden
Ben Kurtovic
088dfe6197
Again, fix link and cleanup.
7 jaren geleden
Ben Kurtovic
21aad51375
Fix unnecessary extra work.
7 jaren geleden
Ben Kurtovic
63f1ae98c4
Show warning when a page has an attribution template.
7 jaren geleden
Ben Kurtovic
a2df4bba0a
Tweak parameters.
8 jaren geleden
Ben Kurtovic
1076959f71
Bugfix in highlight_delta() call; clarify docs.
8 jaren geleden
Ben Kurtovic
2a81217de8
Add support for detailed text comparison in API (T132949)
8 jaren geleden
Ben Kurtovic
8be5c63bc1
Turnitin bugfix.
8 jaren geleden
Ben Kurtovic
137c33bfcb
Fix _format_date() for caches older than a day.
8 jaren geleden
Ben Kurtovic
e6e83ed73c
Revert "ast.literal_eval -> json.loads"
This reverts commit 6d940b5c92
.
8 jaren geleden
Ben Kurtovic
0692227496
Cleanup/tweaks, mainly to rendering.
8 jaren geleden
Ben Kurtovic
6d940b5c92
ast.literal_eval -> json.loads
8 jaren geleden
Ben Kurtovic
83f5588354
Disable turnitin by default for now, until we confirm it works as expected.
8 jaren geleden
Frances Hocutt
4e994f1302
Refactor turnitin.py, incorporate diff link/timestamp
* Add a wiki timestamp parser to copyvios/misc.py
* Refactor copyvios/turnitin.py for more sensible structure
* Update templates/index.mako to incorporate diff link/timestamp and
make it clearer that Turnitin is revision-based checking
9 jaren geleden
Frances Hocutt
1ffa87da0b
Improve turnitin.py docstrings, fix bugs
9 jaren geleden
Frances Hocutt
bf0aa22fa9
[WIP] improve docstrings and naming, mark TODO
9 jaren geleden
Frances Hocutt
efe2300250
[WIP] Basic working integration of turnitin
Add a checkbox to allow searching the EranBot/plagiabot database for
Turnitin results, and display them in a similar form to the on-wiki
EranBot reports if they exist.
Add a new module (copyvios/turnitin.py) to handle fetching and parsing
the EranBot results.
Bug: https://phabricator.wikimedia.org/T110144
TODO: tweak display HTML/CSS; refactor/clean up turnitin.py;
improve dev set-up so it doesn't always default to testwiki
and can test without hardcoding page title
9 jaren geleden
Ben Kurtovic
ad033a6e2b
Pass excluded URL data to the API.
9 jaren geleden
Ben Kurtovic
321e6e3352
Support new 'excluded' column in database.
9 jaren geleden
Ben Kurtovic
4092cbc0f6
Keep all workers' backgrounds in sync, and ensure they update at 00:00 UTC.
9 jaren geleden
Ben Kurtovic
b325ef84cd
Support general/update notices.
9 jaren geleden
Ben Kurtovic
1327f57404
Fix backgrounds when the image has been moved.
9 jaren geleden
Ben Kurtovic
f2c1d5b853
Make cache.langs, cache.projects sorted lists.
10 jaren geleden
Ben Kurtovic
0f4ecfe80a
Auto-bypass cache when cache_possible_miss is True.
10 jaren geleden
Ben Kurtovic
81552f28d3
Bugfix; missed column.
10 jaren geleden
Ben Kurtovic
10e75c7b7a
Cache 'possible_miss' result value.
10 jaren geleden
Ben Kurtovic
331b05baae
Truncate URLs above 1024 chars.
10 jaren geleden
Ben Kurtovic
a07d12699a
Finally fix #3 ; speed up highlighter with a deque.
10 jaren geleden
Ben Kurtovic
f4ff4099be
Fix old reference to query.all_projects.
10 jaren geleden
Ben Kurtovic
a1d8b79265
Fix recursion bug.
10 jaren geleden
Ben Kurtovic
6eed95c761
Use a global cache instead of the request-specific one.
10 jaren geleden
Ben Kurtovic
ddcf2a8963
Store what was previously in SQL tables in flask.g.
10 jaren geleden
Ben Kurtovic
9b07a5e9dc
OrderedDict requires extra parens.
10 jaren geleden
Ben Kurtovic
340ba07c9d
Generate API results as ordered dicts.
10 jaren geleden
Ben Kurtovic
6d87ad41ed
Rearrange functions in api.py.
10 jaren geleden
Ben Kurtovic
2fad6b0b60
Description of API response data.
10 jaren geleden
Ben Kurtovic
89feddba98
Some minor tweaks, plus documentation for API parameters.
10 jaren geleden
Ben Kurtovic
6639338fa8
First version of new API (for #7 )
10 jaren geleden