Commit Graph

38 Commits

Author SHA1 Message Date
7d4d1d7b67 Parse language in readability 2022-05-26 10:55:21 +08:00
2ee95a1c14 Fix cannot convert null to object error message 2022-05-20 20:38:30 +08:00
eadeccce81 Linting fixes 2022-05-18 11:31:42 -07:00
004c766588 If parsing fails, attempt adding <html> wrappers to a document
LinkedDom seems less forgiving and expects the outerHTML of a
document, however older extension versions still send innerHTML.
2022-05-18 10:55:31 -07:00
d68549bcb7 Remove unused code 2022-05-18 15:52:31 +08:00
e76fb02f43 Fix window is not defined for parsing code blocks 2022-05-17 11:01:55 +08:00
1b8850ed33 Fix tests 2022-05-12 17:41:11 +08:00
602d141dec Rename doc to dom 2022-05-12 11:00:32 +08:00
a78a6c6ba4 Replace DomWindow with Document in handlers 2022-05-10 17:01:23 +08:00
6a57281e74 Remove DomWindow usage 2022-05-10 17:00:56 +08:00
acc7654a2f Replace jsdom with linkedom 2022-05-10 16:59:09 +08:00
5698790288 Pass url to readability 2022-05-10 16:53:45 +08:00
59a2639b7d Reduce http call to get jsonld data if title or content or sitename or byline exists 2022-05-09 13:45:45 +08:00
a457c9d128 Update article content only when code blocks exist 2022-05-09 13:45:45 +08:00
eaad96acdd Return parsed dom back to backend 2022-05-06 12:29:08 +08:00
5f5076e864 Highlight code element without reinitialize jsdom 2022-05-06 12:20:54 +08:00
7c6b810522 Remove redundant JSDOM 2022-05-06 10:53:36 +08:00
6d405432af add site_name and site_icon to page model and return in resolver (#341)
* add site_name and site_icon to page model and return in resolver

* fix tests
2022-03-30 10:43:10 +08:00
ff1200f3a1 Use html decoding when getting values from fetched oembed
If we fetch oembed data from an external source, instead of
handling it in readabilityjs we need to html decode it.
2022-03-16 15:29:42 -07:00
2184c2a8d3 Parse online URLs for beehiiv newsletters 2022-03-07 15:49:44 -08:00
2cb5cc065a Add support for identifying newsletters hosted on beehiiv.com 2022-03-07 15:23:56 -08:00
b6fd3e786e Fix parsing authors from page metadata 2022-03-03 19:40:02 -08:00
49092b707d Remove async 2022-03-03 19:31:51 -08:00
05373ba3c7 add methid to parse content metadata 2022-03-03 17:10:06 -08:00
c2e08d0e8f Fetch title and author from page metadate if possible 2022-03-03 15:20:58 -08:00
b326a5f8e7 Add more matches on substack icons 2022-03-03 13:42:30 -08:00
484cd78ac5 prettier 2022-03-02 23:14:10 -08:00
65ce8353dc Attempt to pull URLs for probable newsletter emails out of content 2022-03-02 23:09:10 -08:00
65cc666579 Pass HTML instead of a JSDOM into isProbablyNewsletter to better encapsulate 2022-03-02 20:38:11 -08:00
9206230659 Better name for the save newsletter service 2022-03-02 19:45:28 -08:00
b5f9478350 Dont mutate function input 2022-03-02 16:34:56 -08:00
fc9aa9452c Add a flag in readability to retain table elements in newsletter emails (#152)
* add a flag in readability to retain table elements in newsletter emails

* remove header of axios newsletters
2022-03-01 11:49:38 +08:00
328ebc48cb Apply code block highlighting before running DOM clean 2022-02-26 19:04:32 -08:00
fd39923907 Prettier improvements 2022-02-26 16:38:43 -08:00
84fbc9cd27 Add code highlighting using highlight.js 2022-02-26 14:57:59 -08:00
9431e80be8 add parser 2022-02-17 20:17:02 +08:00
e85273d87a add support for axios newsletters (#49)
* add support for axios newsletters

* fix raw url

* fix getting newsletter url from axios

* add test for url

* add axios parser
2022-02-15 11:56:03 +08:00
84f32935f5 Open source omnivore 2022-02-11 09:24:33 -08:00