Ben Kurtovic
|
f94a67e0e3
|
Define num_queries in the proper place.
|
10 years ago |
Ben Kurtovic
|
12247dd756
|
Add no_links and no_searches to copyvio_check().
|
10 years ago |
Ben Kurtovic
|
f37621e5ec
|
Use a deque for a FIFO instead of the python list LIFO.
|
10 years ago |
Ben Kurtovic
|
8e439e1eea
|
source.join() now blocks when in the middle of processing.
|
10 years ago |
Ben Kurtovic
|
dbb1ae5483
|
Handle empty queues correctly. Remove some log messages.
|
10 years ago |
Ben Kurtovic
|
2fa8aeba5b
|
Fix a blocking issue.
|
10 years ago |
Ben Kurtovic
|
c56838e742
|
Only spawn one worker for comparisons in local mode.
|
10 years ago |
Ben Kurtovic
|
939d8be08f
|
Fix variable.
|
10 years ago |
Ben Kurtovic
|
3ed8837a3e
|
Fix stopping queues in local mode.
|
10 years ago |
Ben Kurtovic
|
de7576728f
|
Fix dequeueing logic a bit.
|
10 years ago |
Ben Kurtovic
|
b939262b11
|
Bugfix.
|
10 years ago |
Ben Kurtovic
|
32ef0fbf1f
|
Add a bunch of temporary debugging code.
|
10 years ago |
Ben Kurtovic
|
c7b3b7bc7f
|
CopyvioSource.workspace should be public.
|
10 years ago |
Ben Kurtovic
|
e73e626994
|
Some locks needed to be tightened.
|
10 years ago |
Ben Kurtovic
|
486c4692ed
|
Remove _workers attr of workspaces.
|
10 years ago |
Ben Kurtovic
|
7c0e98596c
|
Some bugfixes.
|
10 years ago |
Ben Kurtovic
|
361f7709f8
|
Starting work on global workers.
|
10 years ago |
Ben Kurtovic
|
bdcbfa5327
|
Catch errors around response.read().
|
10 years ago |
Ben Kurtovic
|
9b87e2e5f7
|
Fix trying to remove a node that was already removed.
|
10 years ago |
Ben Kurtovic
|
24dd497fd9
|
Catch more general socket.error.
|
10 years ago |
Ben Kurtovic
|
5e72e74759
|
Employ new piecewise article-delta confidence function.
|
10 years ago |
Ben Kurtovic
|
193f96451e
|
Also strip <ref>s in ArticleTextParser.strip().
|
10 years ago |
Ben Kurtovic
|
c4dede1459
|
Reorder length check to potentially fix an empty-query bug.
|
10 years ago |
Ben Kurtovic
|
203c65280c
|
Float delta.
|
10 years ago |
Ben Kurtovic
|
6b0f8ad311
|
Fix reference.
|
10 years ago |
Ben Kurtovic
|
e2d7c7aef6
|
Update with new confidence function; fix unicode.
|
10 years ago |
Ben Kurtovic
|
05010933c7
|
Reorder some URL opening code; zip protection.
|
10 years ago |
Ben Kurtovic
|
4f5a22a2e5
|
Apparently oauth2 converts the query to unicode.
|
10 years ago |
Ben Kurtovic
|
5003c21ff6
|
Quoting the entire query works now.
|
10 years ago |
Ben Kurtovic
|
5677664476
|
Properly encode URL for the search engine.
|
10 years ago |
Ben Kurtovic
|
5890ee6e6a
|
Don't quote_plus() the query.
|
10 years ago |
Ben Kurtovic
|
2bddf79a3d
|
Fix deadlock when calling queue.put() while holding the mutex.
|
10 years ago |
Ben Kurtovic
|
7a4fcd7807
|
Fix queue clear call.
|
10 years ago |
Ben Kurtovic
|
efae85a1fe
|
Move thread spawning code to worker class.
|
10 years ago |
Ben Kurtovic
|
6a90efc812
|
Improve !threads command output.
|
10 years ago |
Ben Kurtovic
|
7137dda920
|
Update copyvio checker to not make concurrent requests to a single domain.
|
10 years ago |
Ben Kurtovic
|
5874467ec3
|
Bugfix, cleanup.
|
10 years ago |
Ben Kurtovic
|
a68bebc43c
|
Tweak git version form a bit.
|
10 years ago |
Ben Kurtovic
|
96631e25f4
|
Make lazy importing code thread-safe.
|
10 years ago |
Ben Kurtovic
|
cc7ac52a05
|
Fix query counting.
|
10 years ago |
Ben Kurtovic
|
d672e670fa
|
Fix param name.
|
10 years ago |
Ben Kurtovic
|
0e28f89466
|
Update logging.
|
10 years ago |
Ben Kurtovic
|
ae0c390ceb
|
Redesign copyvio internals to parallelize URL loading/parsing.
|
10 years ago |
Ben Kurtovic
|
c38fe56ad4
|
Fix edit counter link.
|
10 years ago |
Ben Kurtovic
|
3a8349e8ab
|
Allow regexes in exclusion lists.
|
10 years ago |
Ben Kurtovic
|
3e4dac967d
|
Remove auto-quotes from queries; add min_query; halve max_query.
|
10 years ago |
Ben Kurtovic
|
1501341000
|
Allow even more time for a URL to time out.
|
10 years ago |
Ben Kurtovic
|
6b146a397a
|
Also strip out files and categories in ATP.strip().
|
10 years ago |
Ben Kurtovic
|
ccb3c022ca
|
Some servers don't leave a space before the content type parameter list.
|
10 years ago |
Ben Kurtovic
|
5e9d4cfa78
|
copyvios: use a different timeout for direct URL comparisons.
|
10 years ago |