logo

searx

My custom branche(s) on searx, a meta-search engine git clone https://hacktivis.me/git/searx.git
commit: 5761d6f0ab071bdae05ecef1966dd3e4cbec6eee
parent efde2c21c8656ad21b24980b516ddbbf2e209523
Author: Cqoicebordel <Cqoicebordel@users.noreply.github.com>
Date:   Thu, 29 Jan 2015 21:19:59 +0100

Bing news engine corrections
XPath *never* return None.

(I found the HTML report of coverage)

Diffstat:

Msearx/engines/bing_news.py6++----
1 file changed, 2 insertions(+), 4 deletions(-)

diff --git a/searx/engines/bing_news.py b/searx/engines/bing_news.py @@ -59,16 +59,14 @@ def response(resp): url = link.attrib.get('href') title = extract_text(link) contentXPath = result.xpath('.//div[@class="sn_txt"]/div//span[@class="sn_snip"]') - if contentXPath is not None: - content = escape(extract_text(contentXPath)) + content = escape(extract_text(contentXPath)) # parse publishedDate publishedDateXPath = result.xpath('.//div[@class="sn_txt"]/div' '//span[contains(@class,"sn_ST")]' '//span[contains(@class,"sn_tm")]') - if publishedDateXPath is not None: - publishedDate = escape(extract_text(publishedDateXPath)) + publishedDate = escape(extract_text(publishedDateXPath)) if re.match("^[0-9]+ minute(s|) ago$", publishedDate): timeNumbers = re.findall(r'\d+', publishedDate)