mwparserfromhell

Commit Graph

Author	SHA1	Message	Date
David Ebbo	4acb885b9f	Increase MAX_DEPTH to 100 (#314 ) * Increase MAX_DEPTH to 100 * Update nested_templates_and_style_tags to cover new limit	10 months ago
Ben Kurtovic	af83306e04	Fix pickling SmartLists (fixes #289 )	1 year ago
Ben Kurtovic	f963df7f82	Fix parsing of leading zeros in named HTML entities (fixes #288 )	2 years ago
Ben Kurtovic	2155638b91	Fix regression in parsing nested wikilinks in file captions This regression seems more severe than the bug the commit was attempting to fix (incorrect parsing of nested wikilinks in normal links), so that bug is reintroduced until localization-aware parsing that allows us to detect file links is added. This commit partially reverts `fac60dee48`.	2 years ago
Ben Kurtovic	8cd0bdb322	Autoformat: black + clang-format + clang-tidy	3 years ago
Ben Kurtovic	fac60dee48	Fix parsing of nested wikilinks	3 years ago
Ben Kurtovic	297bcb0cee	Move mwparserfromhell to src/ dir	3 years ago
Ben Kurtovic	074e368684	Clean up pytest-ported tests	3 years ago
Jakub Klinkovský	bb51e8f282	Some fixes for the parsing of external links (#232 ) * Proposed fix for https://github.com/earwig/mwparserfromhell/issues/197 * Port the fix for #197 to the C tokenizer * Fix parsing of external links where the URL is terminated by some special character - One existing test case has been found wrong -- current MediaWiki version always terminates the URL when an opening bracket is encountered. - Other test cases added: double quote, two single quotes and angles always terminate the URL (regardless if it is a free link or external link inside brackets). One single quote does not terminate the URL. * Fix case-insensitive parsing of URI schemes	3 years ago
Jakub Klinkovský	90061b6844	Fix parsing of section headings inside templates (#233 ) Fixes #198 Co-authored-by: Ben Kurtovic <ben.kurtovic@gmail.com>	3 years ago
Ben Kurtovic	b7b3b2e33e	Update changelog; minor tweak to file headers	3 years ago
Ben Kurtovic	1c983d3738	Assorted cleanup, linter fixes, and improvements for Python 3	3 years ago
Kunal Mehta	03181bcd8b	Port tests to use pytest pytest is the preferred way to write and run unit tests these days and it has a cleaner interface - so lets switch to it. The tokenizer tests especially are much easier to read/understand. This was mostly done with find/replace regexes and then cleaned up manually.	4 years ago
Kunal Mehta	7e5297fbe6	Drop Python 2 support Fixes #221.	4 years ago
Yuri Astrakhan	aa37425a9b	move smart_list into sub-package/multiple files Step one of refactoring - making SmartList into its own package, with each class having its own file. No code changes were made. Note that SmartList and ListProxy import each other, so had to import SmartList as a full package name rather than use from ... import ... construct.	4 years ago
Ben Kurtovic	b09b619709	Switch to 'unittest discover' over 'setup.py test'	4 years ago
Ben Kurtovic	b3c98efd22	Fix a parsing bug involving deeply nested style tags (fixes #224 )	4 years ago
Ben Kurtovic	2a4e1f4316	Add contexts.describe() for debugging	4 years ago
Ben Kurtovic	b6e4c59004	Switch to requests for basic API example (closes #219 ); update links	5 years ago
Ben Kurtovic	6136b1b205	Make Wikicode.matches() treat _ and space as equivalent (fixes #216 )	5 years ago
Ben Kurtovic	4775131717	Fix not memoizing bad routes after failing inside a table (fixes #206 )	5 years ago
Ben Kurtovic	6e61c99c90	Update API query example; clarify docstring	5 years ago
Ben Kurtovic	83bcb902b8	Support manual construction of Node objects (fixes #214 )	5 years ago
Ben Kurtovic	0ae5f6d641	Fix regression in previous commit on _ListProxy transformations (fixes #213 )	5 years ago
Ben Kurtovic	840a88bcd6	Fix Wikicode transformation methods on empty sections (fixes #212 )	5 years ago
Ben Kurtovic	de6e671c40	Update changelog, remove now-unneeded test discovery script, cleanup	6 years ago
Hugo	f372d3d495	Upgrade unit test asserts	6 years ago
Hugo	e457b39f32	Upgrade Python syntax with pyupgrade https://github.com/asottile/pyupgrade	6 years ago
Hugo	59636609db	Drop support for EOL Python	6 years ago
Ben Kurtovic	46000ee7c8	Fix test on old Python versions	7 years ago
Ben Kurtovic	253102be35	Minor change to template test_formatting format.	7 years ago
Ben Kurtovic	7a30e47f76	Some improvements to whitespace recognition; unit tests (#185 ).	7 years ago
Ben Kurtovic	cd4f90e663	Fix a rare parsing bug involving nested broken tags.	7 years ago
Ben Kurtovic	8a9c9224be	Speed up parsing deeply nested syntax by caching bad routes (fixes #42 ) Also removed the max cycles stop-gap, allowing much more complex pages to be parsed quickly without losing nodes at the end Also fixes #65, fixes #102, fixes #165, fixes #183 Also fixes #81 (Rafael Nadal parsing bug) Also fixes #53, fixes #58, fixes #88, fixes #152 (duplicate issues)	7 years ago
Ben Kurtovic	d7c755f526	Add Wikicode.contains(), Wikicode.get_ancestors(), Wikicode.get_parent() (#177 )	7 years ago
Ben Kurtovic	68ded2f890	Add keep_template_params to Wikicode.strip_code (#175 )	7 years ago
Ben Kurtovic	6159171e04	Make Template.remove(keep_field=True) slightly more reasonable.	7 years ago
Ben Kurtovic	f34f662f35	Fix len() sometimes raising ValueError on empty node lists (fixes #174 )	7 years ago
Ben Kurtovic	aaffb7f66b	Update copyright for 2016.	8 years ago
Ben Kurtovic	4707b455b5	Add a new test to check for parsing bug; fix an existing test (#142 )	8 years ago
Ben Kurtovic	61b6b98470	Fix two parser bugs involving wikitable error handling.	9 years ago
Ben Kurtovic	50b401549b	Add failing test cases for #125 .	9 years ago
Ben Kurtovic	4f3ab48375	Edge cases involving wikilink -> external link fallback (fixes #120 )	9 years ago
Ben Kurtovic	67214b7c05	Add some failing tests for SmartList features.	9 years ago
Ben Kurtovic	ab9f6a97fb	Use weakrefs for SmartList children; remove _ListProxy.detach().	9 years ago
Ben Kurtovic	2a3a978986	Incomplete code for C tokenizer textbuffer.	9 years ago
Ben Kurtovic	f16c7e25ca	Fully fix parsing templates with blank names, I hope (#111 )	9 years ago
Ben Kurtovic	7993224926	Add a failing test for a missed component of #59	9 years ago
Ben Kurtovic	56f1797cfe	Add failing tests for #111	9 years ago
Ben Kurtovic	46cb714344	Fix unit tests for `699d063` (#109 )	9 years ago

1 2 3 4 5 ...

290 Commits (v0.6.6)