113b87c8c8
count div as newline
2026-04-15 09:22:00 +00:00
alex wennerberg
97a1194cb1
Fix whitespace bug
2026-02-14 13:29:52 -08:00
alex wennerberg
8d57ec3524
Fix entity bug
2024-10-08 10:27:18 -04:00
alex wennerberg
cc8b9b0210
A ton of changes
...
Someone emailed me a patch with a lot of improvements here
2024-10-03 21:27:45 -04:00
Ayrat Badykov
05fc9a55c8
do not add content if href == content
2022-09-24 15:43:54 +03:00
Ayrat Badykov
600ebbe445
parse href along with its content
2022-09-24 14:05:25 +03:00
Ayrat Badykov
10c2a0cbff
fix invalid parsing of invalid html entities
2022-04-23 09:12:55 -07:00
alex wennerberg
1624bd13e6
Remove comment -- this is no longer a line for line rewrite
2022-01-12 18:08:30 -08:00
Johann150
1f678a8621
refactor tests to have meaningful names
2022-01-13 02:05:55 +01:00
Johann150
db479b931e
improve API documentation
2022-01-13 02:05:35 +01:00
Johann150
0b0d350243
refactor core parser
...
Fix some bugs in previously refactored functions.
Change the general processing idea to be "tag oriented" instead of
naively scanning the whole HTML string.
Collapse the cases for different tags that are handled the same.
2022-01-13 02:05:12 +01:00
Johann150
a18add05fb
rewrite is_bad_tag with matches! macro
...
Removing String allocation in iterator.
2022-01-13 00:12:02 +01:00
Johann150
00a9b04d90
make several functions more concise
...
Make more use of try operator and standard library functions.
Do not operate on bytes directly.
2022-01-13 00:11:49 +01:00
Johann150
6e3fd37e73
clean up decode_named_entity
...
Update the code style to current Rust.
Add the proper license notice to the entity.rs file.
2022-01-12 22:19:57 +01:00
Johann150
f5f435a8c5
format entities list with cargo fmt
2022-01-12 21:50:27 +01:00
alex wennerberg
dbe0b94cf0
compiler warning
2022-01-06 10:52:07 -08:00
alex wennerberg
d3806f8c0d
Allow unicode URLs
2022-01-06 10:46:10 -08:00
alex wennerberg
49be4908ba
I think this is a bug fix
2022-01-06 10:13:41 -08:00
alex wennerberg
9ed16fa569
move src/main to src/lib, fix bugs
2022-01-06 09:59:52 -08:00
alex wennerberg
4ac6a10738
bad tag bug fix
2022-01-06 09:36:38 -08:00
alex wennerberg
72f8d61867
fix html entity parsing
2022-01-06 08:54:59 -08:00
alex wennerberg
39d3f5d6a5
Add link support
2022-01-06 08:39:51 -08:00
alex wennerberg
0f96420abb
Extend and add unit tests
2022-01-05 10:12:49 -08:00
alex wennerberg
5e2fb2cc82
Add numeric entity parsing
2022-01-04 07:54:58 -08:00
alex wennerberg
c6368a4906
add entity encoding/decoding
2022-01-03 12:58:30 -08:00
alex wennerberg
b65548e2ca
rewriting more code
2022-01-03 09:57:11 -08:00
alex wennerberg
8ad91b23d6
Update README, get started
2022-01-03 08:57:38 -08:00
alex wennerberg
06549d688c
Initial commit
2021-12-19 08:55:34 -08:00