logo

searx

My custom branche(s) on searx, a meta-search engine
commit: 699fe60e5072eeffebfb2f3b9b85cefe6dd1b7bc
parent: 06186e72a9cb9af89193880b41394cc9be5c5b35
Author: Adam Tauber <asciimoo@gmail.com>
Date:   Thu, 22 Jan 2015 17:43:45 +0100

[fix] handle single closing element in HTMLTextExtractor

Diffstat:

Msearx/utils.py4++++
1 file changed, 4 insertions(+), 0 deletions(-)

diff --git a/searx/utils.py b/searx/utils.py @@ -82,8 +82,12 @@ class HTMLTextExtractor(HTMLParser): self.tags.append(tag) def handle_endtag(self, tag): + if not self.tags: + return + if tag != self.tags[-1]: raise Exception("invalid html") + self.tags.pop() def is_valid_tag(self):