Ben Kurtovic
1bf546b706
Apparently punctuation marks can be used as headers too.
12 anos atrás
Ben Kurtovic
d72be12d41
Make less broken.
12 anos atrás
Ben Kurtovic
b6ddc84561
rawr
12 anos atrás
Ben Kurtovic
7cda82546a
More sophisticated regex for parsing links.
12 anos atrás
Ben Kurtovic
be85dbb72d
Perhaps I should just give up.
12 anos atrás
Ben Kurtovic
410c7b3642
Oh god this is pathetic.
12 anos atrás
Ben Kurtovic
2ceddc2e64
I think this works.
12 anos atrás
Ben Kurtovic
1ca93674e5
Spelling error!
12 anos atrás
Ben Kurtovic
e5b857d20c
Er, fix that.
12 anos atrás
Ben Kurtovic
f5dc31f1da
More support for section headers in !dictionary.
12 anos atrás
Ben Kurtovic
81313e7a79
Copy search_config too.
12 anos atrás
Ben Kurtovic
e53efe0e84
Actually, create a copy of the dict.
12 anos atrás
Ben Kurtovic
53c852e08b
Minor fix if sql's values are not strings.
12 anos atrás
Ben Kurtovic
fb85469261
Typo
12 anos atrás
Ben Kurtovic
d338dc15c5
Clean up command more ( #30 )
12 anos atrás
Ben Kurtovic
97d50e00fc
Fix syntax error ( #30 )
12 anos atrás
Ben Kurtovic
993f9a5f98
!log command ( #30 )
12 anos atrás
Ben Kurtovic
d321cd8a14
Handle all of the 'x of y' as a single case.
12 anos atrás
Ben Kurtovic
ba97e10af9
Fix that a bit. I give up on this command now.
12 anos atrás
Ben Kurtovic
d8d0bcf5fa
Some smarter parsing, plus given names and surnames.
12 anos atrás
Ben Kurtovic
f4c6552778
Support for some more complex parsing. Might break stuff.
12 anos atrás
Ben Kurtovic
a630138679
Synonyms.
12 anos atrás
Ben Kurtovic
2252a34800
Don't allow infinite retries.
12 anos atrás
Ben Kurtovic
b3f0c2dba6
Some more fixes.
12 anos atrás
Ben Kurtovic
6250e813e3
abbreviation
12 anos atrás
Ben Kurtovic
1ca93b4a04
Proper nouns!
12 anos atrás
Ben Kurtovic
fd2348ed1a
Unicode fix for !dict and some stuff.
12 anos atrás
Ben Kurtovic
f1e0a6f4de
Merge branch 'feature/dictionary' into develop
12 anos atrás
Ben Kurtovic
fc563f4ddd
Finish !dictionary command ( #31 ).
12 anos atrás
Ben Kurtovic
fb31aa73c8
Proper handling of unicode in some commands.
12 anos atrás
Ben Kurtovic
3cfedde6bd
A bunch of cleanup and fixes.
12 anos atrás
Ben Kurtovic
e63cd89ed5
Starting !dictionary command ( #31 )
12 anos atrás
Ben Kurtovic
4944c120bd
Merge branch 'feature/copyvios' into develop
OH MY GOD I'M FINALLY DONE.
12 anos atrás
Ben Kurtovic
b42389d393
Substitute \x0301 with \x0F for returning to "normal" colors.
12 anos atrás
Ben Kurtovic
b891f2f6f4
Try \x0F
12 anos atrás
Ben Kurtovic
6032ff958f
Testing a neutral color instead of black.
12 anos atrás
Ben Kurtovic
becd135c52
Minor cleanup for afc_copyvios, mainly Unicode fixes.
12 anos atrás
Ben Kurtovic
439b855254
Fully implement logging; fix non-unicode log messages.
12 anos atrás
Ben Kurtovic
d07f0b5f9a
Add loggers to Category, Page, and User.
12 anos atrás
Ben Kurtovic
1c2dcc999a
__repr__ and __str__ for ExclusionsDB ( #5 ).
12 anos atrás
Ben Kurtovic
a074da853b
More work on copyvios, including an exclusions database ( #5 )
* Added exclusions module with a fully implemented ExclusionsDB that can pull
from multiple sources for different sites.
* Moved CopyvioCheckResult to its own module, to be imported by __init__.
* Some other related changes.
12 anos atrás
Ben Kurtovic
3744a34f28
Allow templated SQL connection info.
12 anos atrás
Ben Kurtovic
c260648bdb
Finish chunking algorithm, improve !link, other fixes.
12 anos atrás
Ben Kurtovic
569c815d99
Implement NLTK for chunking article content ( #5 ).
12 anos atrás
Ben Kurtovic
35eb23046f
Merge pull request #29 from Emalee/develop
Make trout command change target yourself to himself
12 anos atrás
Emalee
ad1ff720b4
apparently the one line if statement is really just a ternary operator?
12 anos atrás
Emalee
377cce1703
make trout command change target yourself to himself
12 anos atrás
Ben Kurtovic
17eee28a4b
Whoops, got the slicing wrong.
12 anos atrás
Ben Kurtovic
bf1ad08dc6
Make Markov chain degree-independent. Testing trigrams.
12 anos atrás
Ben Kurtovic
cb87004107
Primitive screen scraper for HTML using BeautifulSoup and LXML.
Obviously this can and should be improved significantly later, but it seems
good enough for now.
12 anos atrás