Ben Kurtovic
86a8440730
Moving parsers to own file.
пре 12 година
Ben Kurtovic
d4e947b98b
earwigbot.wiki.copyvios.search module split
пре 12 година
Ben Kurtovic
e6a381f3f7
Restructuring copyvio stuff as its own package.
пре 12 година
Ben Kurtovic
9434a416a1
Moved search engine/credential info into config proper.
- In config.json, search config relocated from
tasks.afc_copyvios to wiki.
- Site.__init__() takes a `search_config' argument, which is
auto-supplied from its value in config.json by get_site().
- Page.copyvio_check() doesn't ask for search config
anymore, meaning doing checks from the command line
is less painful.
- Added a Page.copyvio_compare() function, which works
just like copyvio_check() but on a specified URL; this is
for cache retrieval on the web front-end.
пре 12 година
Ben Kurtovic
7cc85f9bc4
afc_copyvios: optionally cache results for the Toolserver.
пре 12 година
Ben Kurtovic
f382ceb38e
Pushing some smarter logic for MarkovChains
- Incomplete; need this for the TS rewrite
- Also starting work on docstrings for some methods
пре 12 година
Ben Kurtovic
7e6f1e8128
soxred93 -> tparis; toolserver account expired
пре 12 година
Ben Kurtovic
755dff9714
Copyvios: auto-fail very small articles (< 20 chain links)
пре 12 година
Ben Kurtovic
6009c050f9
Minor integer division fix.
пре 12 година
Ben Kurtovic
df7868da3e
Updates to copyright violation stuff.
пре 12 година
Ben Kurtovic
ee2b1133bb
Algorithm for comparing article content against a suspected source using MarkovChains
пре 13 година
Ben Kurtovic
2da906109b
Copyright update for 2012.
пре 13 година
Ben Kurtovic
13100533b9
CopyrightMixin needs Page._site
пре 13 година
Ben Kurtovic
c48073515b
#wikipedia-en-afc -> #wikipedia-en-afc-feed
пре 13 година
Ben Kurtovic
24f7eabb77
Some more work on copyvio detection code
Also removed the hardcoded version in user-agent strings.
пре 13 година
Ben Kurtovic
56e6140284
More work on copyright violation detection code.
пре 13 година
Ben Kurtovic
0b6d5eac5e
Some code for copyvio detection, including querying Yahoo! BOSS correctly.
пре 13 година
Ben Kurtovic
42081ab1c7
Reorder formatting so we don't destroy our datetime object before we're done with it.
пре 13 година
Ben Kurtovic
53533e1821
Support for a hidden date sortkey.
Also fixing a bug in get_notes() with regards to the 'old' note/warning.
пре 13 година
Ben Kurtovic
15748c6edc
Fixing unit test code.
пре 13 година
Ben Kurtovic
27867763eb
Disabling auto-linker.
пре 13 година
Ben Kurtovic
bff00f9b28
Restruturing codebase to be a bit more Pythonic.
пре 13 година
Ben Kurtovic
dbe9b57153
Sleep a bit more logically, adjust color.
пре 13 година
Ben Kurtovic
8f2b82b254
Wrong axis!
пре 13 година
Ben Kurtovic
3c343a175b
Looks like the axis kwarg was added to Axes matlab>1.0.1
пре 13 година
Ben Kurtovic
3858692247
Fix.
пре 13 година
Ben Kurtovic
63a30d2247
Prettify chart a bit.
пре 13 година
Ben Kurtovic
bb6e8f1063
Fix bottom kwarg for p3.
пре 13 година
Ben Kurtovic
b313412905
Fixes/improvements.
пре 13 година
Ben Kurtovic
2f86fe5b07
Fix data reversal.
пре 13 година
Ben Kurtovic
85af03df25
self.dest -> self.destination
пре 13 година
Ben Kurtovic
1701016a7d
Graphing with matplotlib.
пре 13 година
Ben Kurtovic
149fe8fdb4
Fix.
пре 13 година
Ben Kurtovic
4eca6a83e7
Forgot to int() custom num_days; fix generate().
пре 13 година
Ben Kurtovic
522e44abc4
Oops.
пре 13 година
Ben Kurtovic
ed8751970f
Fix update_date() when a submission is in multiple date categories.
пре 13 година
Ben Kurtovic
5fc281336a
Fix query in get_status().
пре 13 година
Ben Kurtovic
656d904515
Sleep a bit in between API queries.
пре 13 година
Ben Kurtovic
396a21c4a8
CHART_ -> STATUS_
пре 13 година
Ben Kurtovic
6bb438b60b
Fixes.
пре 13 година
Ben Kurtovic
13476460c9
Replag command; fix.
пре 13 година
Ben Kurtovic
a9f07c2611
Forgot import config >_>
пре 13 година
Ben Kurtovic
d949269944
Some SQL updates, starting work on afc_history task.
* get() -> return a Task instance by name (tasks)
* Using SQL to save API queries. (commands.{afc_report,afc_status})
* ignore_list -> ignoreList in config. (tasks.afc_statistics)
пре 13 година
Ben Kurtovic
3e5801d5e6
!status nocolor for HallowsAG
пре 13 година
Ben Kurtovic
5088315db4
Differentiate between completely blank pages and nonexistant or unretrievable pages.
пре 13 година
Ben Kurtovic
609b1cce75
Misplaced submissions can't be 'resumbmitted'; page_special_user is only significant for CHART_PEND and CHART_DRAFT.
пре 13 година
Ben Kurtovic
7757af58fd
Fix; get_size() rounds to tenths instead of hundreds.
пре 13 година
Ben Kurtovic
1c850764d7
Fixes in Unicode, status determination; removed created.
пре 13 година
Ben Kurtovic
9ec03520b3
Fix.
пре 13 година
Ben Kurtovic
e02ff2d5db
Fix get_status_and_chart(content, namespace)
пре 13 година