Ben Kurtovic
86a8440730
Moving parsers to own file.
12 anni fa
Ben Kurtovic
d4e947b98b
earwigbot.wiki.copyvios.search module split
12 anni fa
Ben Kurtovic
e6a381f3f7
Restructuring copyvio stuff as its own package.
12 anni fa
Ben Kurtovic
9434a416a1
Moved search engine/credential info into config proper.
- In config.json, search config relocated from
tasks.afc_copyvios to wiki.
- Site.__init__() takes a `search_config' argument, which is
auto-supplied from its value in config.json by get_site().
- Page.copyvio_check() doesn't ask for search config
anymore, meaning doing checks from the command line
is less painful.
- Added a Page.copyvio_compare() function, which works
just like copyvio_check() but on a specified URL; this is
for cache retrieval on the web front-end.
12 anni fa
Ben Kurtovic
7cc85f9bc4
afc_copyvios: optionally cache results for the Toolserver.
12 anni fa
Ben Kurtovic
f382ceb38e
Pushing some smarter logic for MarkovChains
- Incomplete; need this for the TS rewrite
- Also starting work on docstrings for some methods
12 anni fa
Ben Kurtovic
7e6f1e8128
soxred93 -> tparis; toolserver account expired
12 anni fa
Ben Kurtovic
755dff9714
Copyvios: auto-fail very small articles (< 20 chain links)
12 anni fa
Ben Kurtovic
6009c050f9
Minor integer division fix.
12 anni fa
Ben Kurtovic
df7868da3e
Updates to copyright violation stuff.
12 anni fa
Ben Kurtovic
ee2b1133bb
Algorithm for comparing article content against a suspected source using MarkovChains
13 anni fa
Ben Kurtovic
2da906109b
Copyright update for 2012.
13 anni fa
Ben Kurtovic
13100533b9
CopyrightMixin needs Page._site
13 anni fa
Ben Kurtovic
c48073515b
#wikipedia-en-afc -> #wikipedia-en-afc-feed
13 anni fa
Ben Kurtovic
24f7eabb77
Some more work on copyvio detection code
Also removed the hardcoded version in user-agent strings.
13 anni fa
Ben Kurtovic
56e6140284
More work on copyright violation detection code.
13 anni fa
Ben Kurtovic
0b6d5eac5e
Some code for copyvio detection, including querying Yahoo! BOSS correctly.
13 anni fa
Ben Kurtovic
42081ab1c7
Reorder formatting so we don't destroy our datetime object before we're done with it.
13 anni fa
Ben Kurtovic
53533e1821
Support for a hidden date sortkey.
Also fixing a bug in get_notes() with regards to the 'old' note/warning.
13 anni fa
Ben Kurtovic
15748c6edc
Fixing unit test code.
13 anni fa
Ben Kurtovic
27867763eb
Disabling auto-linker.
13 anni fa
Ben Kurtovic
bff00f9b28
Restruturing codebase to be a bit more Pythonic.
13 anni fa
Ben Kurtovic
dbe9b57153
Sleep a bit more logically, adjust color.
13 anni fa
Ben Kurtovic
8f2b82b254
Wrong axis!
13 anni fa
Ben Kurtovic
3c343a175b
Looks like the axis kwarg was added to Axes matlab>1.0.1
13 anni fa
Ben Kurtovic
3858692247
Fix.
13 anni fa
Ben Kurtovic
63a30d2247
Prettify chart a bit.
13 anni fa
Ben Kurtovic
bb6e8f1063
Fix bottom kwarg for p3.
13 anni fa
Ben Kurtovic
b313412905
Fixes/improvements.
13 anni fa
Ben Kurtovic
2f86fe5b07
Fix data reversal.
13 anni fa
Ben Kurtovic
85af03df25
self.dest -> self.destination
13 anni fa
Ben Kurtovic
1701016a7d
Graphing with matplotlib.
13 anni fa
Ben Kurtovic
149fe8fdb4
Fix.
13 anni fa
Ben Kurtovic
4eca6a83e7
Forgot to int() custom num_days; fix generate().
13 anni fa
Ben Kurtovic
522e44abc4
Oops.
13 anni fa
Ben Kurtovic
ed8751970f
Fix update_date() when a submission is in multiple date categories.
13 anni fa
Ben Kurtovic
5fc281336a
Fix query in get_status().
13 anni fa
Ben Kurtovic
656d904515
Sleep a bit in between API queries.
13 anni fa
Ben Kurtovic
396a21c4a8
CHART_ -> STATUS_
13 anni fa
Ben Kurtovic
6bb438b60b
Fixes.
13 anni fa
Ben Kurtovic
13476460c9
Replag command; fix.
13 anni fa
Ben Kurtovic
a9f07c2611
Forgot import config >_>
13 anni fa
Ben Kurtovic
d949269944
Some SQL updates, starting work on afc_history task.
* get() -> return a Task instance by name (tasks)
* Using SQL to save API queries. (commands.{afc_report,afc_status})
* ignore_list -> ignoreList in config. (tasks.afc_statistics)
13 anni fa
Ben Kurtovic
3e5801d5e6
!status nocolor for HallowsAG
13 anni fa
Ben Kurtovic
5088315db4
Differentiate between completely blank pages and nonexistant or unretrievable pages.
13 anni fa
Ben Kurtovic
609b1cce75
Misplaced submissions can't be 'resumbmitted'; page_special_user is only significant for CHART_PEND and CHART_DRAFT.
13 anni fa
Ben Kurtovic
7757af58fd
Fix; get_size() rounds to tenths instead of hundreds.
13 anni fa
Ben Kurtovic
1c850764d7
Fixes in Unicode, status determination; removed created.
13 anni fa
Ben Kurtovic
9ec03520b3
Fix.
13 anni fa
Ben Kurtovic
e02ff2d5db
Fix get_status_and_chart(content, namespace)
13 anni fa