Commit Graph

56 Commits

Author SHA1 Message Date
db4f225b5c Merge pull request #1426 from omnivore-app/fix/github-handler-titles
Use greedy matching on the GitHub title regex
2022-11-17 13:42:23 +08:00
c18217e697 Fix logRecord being shared by requests 2022-11-17 11:28:46 +08:00
d9237723c5 Share puppeteer browser instance between content handlers 2022-11-17 11:28:46 +08:00
e866541ae1 Fix puppeteer launch in head mode 2022-11-17 11:28:46 +08:00
8bdbd86e97 Make headless = false in local testing 2022-11-17 11:27:51 +08:00
0d83ecb068 Use greedy matching on the GitHub title regex
This fixes issues with titles that have slashes (/) in their title
as the regex match would look for the last possible matching
slash.
2022-11-16 13:52:58 +08:00
c8febe900f Fix prettier error 2022-11-16 10:34:52 +08:00
726f69484f Better content extraction of title and description for GitHub URLs 2022-11-15 18:43:55 +08:00
9d753e3c0e Forward convertkit confirmation email 2022-11-02 15:27:30 +08:00
ae3ff6f34b Add test case 2022-11-01 17:48:39 +08:00
67082a478e Add support for hey world newsletters 2022-11-01 16:41:42 +08:00
8712a3efef Merge pull request #1373 from omnivore-app/fix/substack-embedded-tweets
Handle embedded tweets in substack emails
2022-11-01 15:04:03 +08:00
9b29a3c261 querySelectorAll returns a NodeList which we can iterate 2022-11-01 10:34:18 +08:00
392ac15a62 Treat all tweets as elements so we dont need to cast 2022-11-01 10:31:12 +08:00
a4f0698974 Handle older style substack image embeds as well as new 2022-10-31 21:57:29 +08:00
2622437831 Fix linting 2022-10-31 21:42:31 +08:00
cc91e43572 Handle embedded tweets in substack emails
This does a few things:
- tags static tweets found in substack emails with a special class
- upgrades readability to ignore special class names
- reduces some readability debug output
2022-10-31 21:28:36 +08:00
a7b208fd26 Skip non-author replies in twitter thread 2022-10-31 16:51:20 +08:00
1067747cc2 Add twitter author when saved 2022-10-27 19:02:16 +08:00
a536ed9b5e Save newer thread with puppeteer too 2022-10-27 14:20:20 +08:00
b556bc17e8 Set navigation timeout to be 60 seconds 2022-10-27 13:05:35 +08:00
6ca1e7dc25 Add no-sandbox in the puppeteer args 2022-10-27 12:49:00 +08:00
03777587f2 Fetch max 100 tweets in a thread 2022-10-26 22:44:19 +08:00
b5926ccf1c Get old tweet thread with puppeteer and new tweet with twitter api 2022-10-26 20:41:51 +08:00
5f0d0ed69b Get tweet reply ids from thread id using puppeteer 2022-10-25 22:09:16 +08:00
343c8fb5cc Get tweet reply ids from thread id 2022-10-25 18:25:49 +08:00
31bc9fe5be Fix node-weekly newsletter getting forwarded
* This error is caused by a previous rebase conflict and resolving the conflict caused an important line missing
* Revert the change to add CooperPressHandler to the array of newsletter handlers
* Updated the test to get the correct handler from email content to prevent such cases
2022-10-21 09:57:05 +08:00
706607ea2b Rebase main 2022-10-18 14:54:33 +08:00
1fda6946bb Save newsletters hosted by cooper-press.com correctly 2022-10-18 14:53:02 +08:00
d8c5c97960 Save newsletters hosted by ghost.org correctly 2022-10-17 19:27:02 +08:00
37439c6622 Save fs.blog newsletter correctly 2022-10-17 17:59:27 +08:00
ff21770a70 Add GitHubHandler class 2022-10-13 18:38:01 +08:00
f39c6923b0 Create a content handler for GitHub pages
This will search for the markdown <article> element and make it
the only page element so we add a repo's README.
2022-10-13 18:11:14 +08:00
5dcde8dac3 Fix not correctly identify milkroad newsletters due to the HTML format changes again by the author 2022-10-13 10:52:03 +08:00
08c30f9c24 Show the original tweet firstly 2022-10-10 15:33:36 +08:00
5d70d04154 If the saved tweet is a reply, we need to get the referenced tweet 2022-10-10 15:12:41 +08:00
220576235f Merge pull request #1291 from omnivore-app/sywhb-patch-1
Get tweet thread by conversation_id
2022-10-10 09:52:30 +08:00
8f048dd15e Change the milkroad newsletters parser according to the change of milkroad newsletter 2022-10-08 09:37:12 +08:00
06c1e09505 lint 2022-10-08 09:28:50 +08:00
2ff995d3b2 Get tweet thread by conversation_id 2022-10-08 09:09:18 +08:00
c9400ca517 Remove dom param in preHandler method 2022-10-07 16:57:01 +08:00
49ed8e279b Calls preParseContent function in content-handler package before parsing content 2022-10-07 16:56:38 +08:00
810df2ad8c Fix the tweets in reverse order and media file not showing 2022-10-07 16:31:56 +08:00
a8f964f615 Save only tweets in the thread by the original author 2022-10-07 13:47:20 +08:00
5bde463c94 Save unrolled recent twitter thread 2022-10-06 16:56:12 +08:00
61303419a0 Add timeout 2022-10-04 14:25:19 +08:00
b00a516737 Add other newsletter handlers 2022-09-30 12:51:23 +08:00
9b209314a6 Rename content to websites 2022-09-30 12:51:22 +08:00
f7b20f3186 Add newsletter handlers 2022-09-30 12:51:22 +08:00
dfbd317f99 Import content-handler in inbound-email-handler 2022-09-30 12:51:22 +08:00