Ben Kurtovic
a1948b06aa
Tokenizer_parse_bracketed/free_uri_scheme(), other adjustments
10 jaren geleden
Ben Kurtovic
7b84b3f0df
Refactor out C's is_marker(); hooks for ext links.
10 jaren geleden
Ben Kurtovic
d42e05a554
Implement improved wikilink handling.
10 jaren geleden
Ben Kurtovic
5e6e5b6301
tag_defs.py -> definitions.py; more outline stuff
10 jaren geleden
Ben Kurtovic
cbf67c7842
Add hooks for some ext link stuff; add a INVALID_LINK aggregate context.
10 jaren geleden
Ben Kurtovic
8923d96a57
More unification.
10 jaren geleden
Ben Kurtovic
b5ec7f3beb
Fix py3k module importing; stick a bunch of macros in one place.
10 jaren geleden
Ben Kurtovic
25d53cacf8
Begin porting C tokenizer to Python 3.
10 jaren geleden
Ben Kurtovic
ebf99d722c
Combine emit()/emit_first() internally.
10 jaren geleden
Ben Kurtovic
51ac97de04
Make macros out of the failing/unsafe contexts.
10 jaren geleden
Ben Kurtovic
df9f7388b6
emit_FAST(), emit_first_FAST(); update comment parsing
10 jaren geleden
Ben Kurtovic
c20d3f2a6a
handle_list_marker() and handle_list()
10 jaren geleden
Ben Kurtovic
9b98907751
Add C hooks and prototypes for wiki-markup tags.
10 jaren geleden
Ben Kurtovic
4663563ce4
Remove unnecessary markers.
10 jaren geleden
Ben Kurtovic
e83f321340
Rearrange functions; remove useless prototypes.
10 jaren geleden
Ben Kurtovic
e3fc27c9e3
Refactor TagData code into dedicated functions.
11 jaren geleden
Ben Kurtovic
d02a6da81e
Implement Tokenizer_handle_tag_space(); refactor textbuffer writing.
- Add a test for very long strings of text.
11 jaren geleden
Ben Kurtovic
9365fcf6e4
Implement Tokenizer_handle_tag_data(); add a read-backwards macro.
11 jaren geleden
Ben Kurtovic
e636bf77cf
Implement Tokenizer_push_tag_buffer()
11 jaren geleden
Ben Kurtovic
653071379b
Finish porting misc changes; add prototypes for remaining functions.
11 jaren geleden
Ben Kurtovic
aca0f78cd7
Port more Python tokenizer updates to C.
11 jaren geleden
Ben Kurtovic
f67cf46900
Start C port of tag tokenization; refactor the init func.
11 jaren geleden
Ben
a689467577
Replace broken log2 function; add a missing comment.
11 jaren geleden
Ben Kurtovic
9ede1121ba
Fix tokenizer.c on Windows; add another template test ( #25 )
Mostly by @gdooms , with tweaks.
11 jaren geleden
Ben Kurtovic
debcb6577e
Fix recursion issues by giving up at a certain point ( closes #16 ).
- Stop parsing new templates if the template depth gets above
MAX_DEPTH (40) or if we've already tried to parse over MAX_CYCLES
(100,000) templates.
- Add two tests to ensure recursion works somewhat correctly.
- Fix parsing the string "{{" with the Python tokenizer; add a test.
11 jaren geleden
Ben Kurtovic
054a84afe0
A bit of misc cleanup.
11 jaren geleden
Ben Kurtovic
718fcb24c8
Fix eight failing tests; all template parsing tests now passing ( #25 ).
11 jaren geleden
Ben Kurtovic
5a0a00ba98
Change the way verify_safe() handles template params ( #25 ).
- Newlines are now allowed in template param names.
- Changes also affect handling of arguments like {{{foo}}}.
- Update unit tests: remove some unnecessary ones, and add some to cover the changes.
- Update StringMixIn tests to actually work for some of the methods.
- Update copyright notices for the C extensions.
11 jaren geleden
Ben Kurtovic
0803417901
Port CTokenizer's verify_safe method to Python to solve a failing test.
11 jaren geleden
Ben Kurtovic
d6f2723a06
Fix safety checks on template params in some odd cases ( closes #24 ).
Also, fix parsing of wikilinks in both tokenizers such that newlines
in any location within the title are an automatic failure.
11 jaren geleden
Ben Kurtovic
1f47e10f04
Start some Py3k compatibility stuff.
11 jaren geleden
Ben Kurtovic
4a725b7ac5
Fix another couple bugs regarding template contexts and verify_safe().
11 jaren geleden
Ben Kurtovic
06b20dd8c0
More updates.
11 jaren geleden
Ben Kurtovic
16024494cf
Mostly implement Tokenizer_really_parse_entity().
11 jaren geleden
Ben Kurtovic
7de34ed5a6
Rewrite verify_safe completely.
11 jaren geleden
Ben Kurtovic
cdef073a9b
Replace textbuffer with our own datastructure.
11 jaren geleden
Ben Kurtovic
f0a36f3262
Store tokens in a global variable instead of retrieving anew each time.
11 jaren geleden
Ben Kurtovic
6d73eeeab1
Replace Python list of stacks with a singly linked list.
11 jaren geleden
Ben Kurtovic
ff0b4439f8
Replace Python exception machinery with a global and some macros.
11 jaren geleden
Ben Kurtovic
8e4819ea37
Fix exception implementation; some other cleanup and fixes.
11 jaren geleden
Ben Kurtovic
9993f1ba76
Fix object calls.
11 jaren geleden
Ben Kurtovic
0be18bc5b8
Fix Py_UNICODE comparisons.
11 jaren geleden
Ben Kurtovic
17a09e395a
Fix some bugs.
11 jaren geleden
Ben Kurtovic
48188bfa99
Implement Tokenizer_parse_heading(), some other stuff.
11 jaren geleden
Ben Kurtovic
849016f734
Implement Tokenizer_verify_safe() and some others.
11 jaren geleden
Ben Kurtovic
0d720a7ef1
Implement Tokenizer_parse_template(); NOARGS and NOKWARGS.
11 jaren geleden
Ben Kurtovic
6edc24037e
Implement Tokenizer_parse_template_or_argument().
11 jaren geleden
Ben Kurtovic
7fc45783b7
Add a header file; improve context handling.
11 jaren geleden