Ben Kurtovic
929fa25e1f
Correctly tokenize single-only HTML tags, part two.
11 lat temu
Ben Kurtovic
b24ddaea10
Tokenizer support for implicitly self-closing tags.
11 lat temu
Ben Kurtovic
d3a4078880
Tests for single and single_only tags.
11 lat temu
Ben Kurtovic
50beda0914
Improve/fix the way padding is handled.
11 lat temu
Ben Kurtovic
2a71c18892
Four tests for <nowiki> + friends.
11 lat temu
Ben Kurtovic
e5ada4753a
Adding a test for triple backslashes.
11 lat temu
Ben Kurtovic
ffb554c36f
Fix broken tests.
11 lat temu
Ben Kurtovic
4c7ed9c6a9
Add seven tests related to backslashes before quotes.
11 lat temu
Ben Kurtovic
17c71e335f
Add three tests for invalid attribute quote usage.
11 lat temu
Ben Kurtovic
dd6bb1637d
Support tag nesting properly; unit tests; recursion checks for tags.
11 lat temu
Ben Kurtovic
f63480bcf3
Update the integration.rich_tags test to use the new tag tokens.
Remove an now-unused import in the tokenizer.
11 lat temu
Ben Kurtovic
43e717cca9
Add a number of new tag tests.
11 lat temu
Ben Kurtovic
81e8fdd682
Give Attributes more attributes for padding data.
11 lat temu
Ben Kurtovic
ce27d5d385
Fix six failing tests; add three more (all passing).
11 lat temu
Ben Kurtovic
6450814729
Remove 'type' attribute from tags; rework tag definitions.
11 lat temu
Ben Kurtovic
03e41286c6
Add a number of tag tests. A couple of these are failing.
11 lat temu
Ben Kurtovic
9ea06c2830
Push the textbuffer to fix a couple broken tests.
11 lat temu
Ben Kurtovic
1b4c01b4c0
Implement assertTagNodeEqual(), start test_tag(), add to tags.mwtest.
11 lat temu
Ben Kurtovic
ec6ba67dbe
Tokenizer integration tests; finish unit test coverage ( closes #7 )
11 lat temu
Ben Kurtovic
6e2ca06ebe
Tokenizer tests for HTML entities.
11 lat temu
Ben Kurtovic
6070eff657
Tokenizer tests for headings. Also, add an integration test.
11 lat temu
Ben Kurtovic
dd199c4389
Fix tokenizer tests with setup.py test.
11 lat temu
Ben Kurtovic
9e527146c7
Tokenizer tests for comments.
11 lat temu
Ben Kurtovic
ebaa0e60b6
Tokenizer tests for wikilinks.
Also add one for arguments.
11 lat temu
Ben Kurtovic
db06eda8c0
Tokenizer tests for arguments.
Also add a couple for templates and one for integration.
11 lat temu
Ben Kurtovic
a0fb8361eb
Can specify specific tokenizer tests via the command line; add files
11 lat temu
Ben Kurtovic
9ede1121ba
Fix tokenizer.c on Windows; add another template test ( #25 )
Mostly by @gdooms , with tweaks.
11 lat temu
Ben Kurtovic
debcb6577e
Fix recursion issues by giving up at a certain point ( closes #16 ).
- Stop parsing new templates if the template depth gets above
MAX_DEPTH (40) or if we've already tried to parse over MAX_CYCLES
(100,000) templates.
- Add two tests to ensure recursion works somewhat correctly.
- Fix parsing the string "{{" with the Python tokenizer; add a test.
11 lat temu
Ben Kurtovic
5a0a00ba98
Change the way verify_safe() handles template params ( #25 ).
- Newlines are now allowed in template param names.
- Changes also affect handling of arguments like {{{foo}}}.
- Update unit tests: remove some unnecessary ones, and add some to cover the changes.
- Update StringMixIn tests to actually work for some of the methods.
- Update copyright notices for the C extensions.
11 lat temu
Ben Kurtovic
31a977bdfe
Finish all incomplete template tests.
11 lat temu
Ben Kurtovic
24c55aeeb1
Adding a syntax highlighter for the test-case format.
11 lat temu
Ben Kurtovic
556477f801
Adding a bunch more tests.
11 lat temu
Ben Kurtovic
660a0c31e6
Adding a bunch more tests.
11 lat temu
Ben Kurtovic
d500f8972e
Add a few more tests; use assert*(expected, actual) instead of opposite.
11 lat temu
Ben Kurtovic
713b83a4d9
Added a metric ton of template tests; adjustments; docstrings.
11 lat temu
Ben Kurtovic
eb1bd6b281
Add some basic tests for templates; adjust error messages again.
11 lat temu
Ben Kurtovic
ecfb2c628f
Another test; handle errors when reading output line better.
11 lat temu
Ben Kurtovic
357b6dc447
Make unit tests work in Python 3; add a unicode text test.
11 lat temu
Ben Kurtovic
4636fbeb4a
Built an infrastructure for loading and running tokenizer tests.
11 lat temu