Ben Kurtovic
|
9ffc3f1bf5
|
Raise file crawl size limit for PDFs.
|
10 lat temu |
Ben Kurtovic
|
33107c6a01
|
Add a lastrevid property to pages.
|
10 lat temu |
Ben Kurtovic
|
b87d5ac673
|
Pass parameter to recursive call.
|
10 lat temu |
Ben Kurtovic
|
170f810735
|
Allow ExclusionDB to force a sync.
|
10 lat temu |
Ben Kurtovic
|
86f8a6c4f9
|
Implement !cancel, !unremind, and !forget for reminders.
|
10 lat temu |
Ben Kurtovic
|
6ae3cd6d08
|
Handle interwiki page titles correctly.
|
10 lat temu |
Ben Kurtovic
|
901192ec18
|
Handle errors from UnicodeDamnit.
|
10 lat temu |
Ben Kurtovic
|
3f2dd1094f
|
Catch HTTPException in opener.open.
|
10 lat temu |
Ben Kurtovic
|
9eaad11efb
|
Fix unicode bug in exception.
|
10 lat temu |
Ben Kurtovic
|
699f6e3b17
|
Seems it will sometimes raise AssertionError.
|
10 lat temu |
Ben Kurtovic
|
12c5170815
|
Catch another exception thrown by pdfminer.
|
10 lat temu |
Ben Kurtovic
|
08d02917f2
|
Strange typo.
|
10 lat temu |
Ben Kurtovic
|
c2a5946874
|
Fix generating -0.0 as a confidence value.
|
10 lat temu |
Ben Kurtovic
|
106e58b164
|
Update confidence function comments.
|
10 lat temu |
Ben Kurtovic
|
b8d55973c9
|
Tell Yahoo! it's okay to return PDFs.
|
10 lat temu |
Ben Kurtovic
|
5194525a32
|
Note when sources might have been missed.
|
10 lat temu |
Ben Kurtovic
|
065d9ea498
|
Fix; should always return a float.
|
10 lat temu |
Ben Kurtovic
|
290f81abed
|
Prevent -0.0 from being a confidence value.
|
10 lat temu |
Ben Kurtovic
|
932b93572a
|
Simplify function.
|
10 lat temu |
Ben Kurtovic
|
80d2735716
|
Bugfix.
|
10 lat temu |
Ben Kurtovic
|
c964c3af82
|
Better solution to previous commit.
|
10 lat temu |
Ben Kurtovic
|
c4ee35a09d
|
Support case-insensitive language lookups.
|
10 lat temu |
Ben Kurtovic
|
05fc3a3e4a
|
Fix handling of mulilingual defines.
|
10 lat temu |
Ben Kurtovic
|
00474e5962
|
Fix multilingual dict bug.
|
10 lat temu |
Ben Kurtovic
|
d963e13af0
|
Merge pull request #51 from Riamse/patch-1
Improve ultilingual dictionary capabilities
|
10 lat temu |
Riamse
|
d09d5304f9
|
Improve ultilingual dictionary capabilities
Add the ability to search for a definition in a specific language
|
10 lat temu |
Ben Kurtovic
|
07bbf240f6
|
Support new method for query continuation.
|
10 lat temu |
Ben Kurtovic
|
459c252fc7
|
Support new CSRF token API.
|
10 lat temu |
Ben Kurtovic
|
77514ee925
|
Add another PDF string substitution.
|
10 lat temu |
Ben Kurtovic
|
0bdcbca8b0
|
Rudimentary solution for PDF parsing (closes earwig/copyvios#18)
|
10 lat temu |
Ben Kurtovic
|
30f72df470
|
Refactor parsers; fix empty document behavior.
|
10 lat temu |
Ben Kurtovic
|
5349179088
|
Fix parsing of plain text documents (earwig/copyvios#3)
|
10 lat temu |
Ben Kurtovic
|
f10908e34e
|
Handle struct.error from GzipFile.read() (Python bug?)
|
10 lat temu |
Ben Kurtovic
|
693cdc302f
|
Catch errors while searching.
|
10 lat temu |
Ben Kurtovic
|
303c39c8c7
|
Add an option to disable short-circuiting.
|
10 lat temu |
Ben Kurtovic
|
f8f4669460
|
Remove unnecessary key attribute of sources.
|
10 lat temu |
Ben Kurtovic
|
9fd145da5c
|
Add some docs; better sorting function.
|
10 lat temu |
Ben Kurtovic
|
7afb484cea
|
Refactor a bunch of copyvio internals. Store all sources with a result object.
|
10 lat temu |
Ben Kurtovic
|
e88d1c2c70
|
Fix lazy module behavior after failure.
|
10 lat temu |
Ben Kurtovic
|
54ddff049f
|
Make CopyvioSource public; tweaks.
|
10 lat temu |
Ben Kurtovic
|
0438766ee4
|
Handle empty URLs better.
|
10 lat temu |
Ben Kurtovic
|
2147207388
|
Remove unnecessary variable assign.
|
10 lat temu |
Ben Kurtovic
|
f94a67e0e3
|
Define num_queries in the proper place.
|
10 lat temu |
Ben Kurtovic
|
12247dd756
|
Add no_links and no_searches to copyvio_check().
|
10 lat temu |
Ben Kurtovic
|
f37621e5ec
|
Use a deque for a FIFO instead of the python list LIFO.
|
10 lat temu |
Ben Kurtovic
|
8e439e1eea
|
source.join() now blocks when in the middle of processing.
|
10 lat temu |
Ben Kurtovic
|
dbb1ae5483
|
Handle empty queues correctly. Remove some log messages.
|
10 lat temu |
Ben Kurtovic
|
2fa8aeba5b
|
Fix a blocking issue.
|
10 lat temu |
Ben Kurtovic
|
c56838e742
|
Only spawn one worker for comparisons in local mode.
|
10 lat temu |
Ben Kurtovic
|
939d8be08f
|
Fix variable.
|
10 lat temu |