Frédéric Guillot
3afdf25012
Do not proxy image data url
2020-10-14 22:26:54 -07:00
Frédéric Guillot
31435ef83e
Add rewrite rule to fix Medium.com images
2020-09-29 22:27:32 -07:00
Frédéric Guillot
d75ff0c5ab
Add sanitizer support for responsive images
...
- Add support for picture HTML tag
- Add support for srcset, media, and sizes attributes to img and source tags
2020-09-28 23:22:08 -07:00
Frédéric Guillot
c394a61a4e
Add Prometheus exporter
2020-09-27 20:04:48 -07:00
Frédéric Guillot
16b7b3bc3e
http client: remove dependency on global config options
2020-09-27 14:37:46 -07:00
Dave Marquard
eb026ae4ac
handle Pacific Daylight Time in addition to Pacific Standard Time
2020-09-22 19:47:36 -07:00
Frédéric Guillot
0d0395b4e3
Do not try to update a duplicated feed after a refresh
2020-09-20 23:42:18 -07:00
Frédéric Guillot
e6c6ee441a
Use a transaction to refresh and create entries
...
Also includes few database improvements:
- Speed up entries clean up with an index and a goroutine
- Avoid the accumulation of enclosures for some feeds
2020-09-20 23:12:23 -07:00
Frédéric Guillot
bfb96d536e
Add workaround for parsing an invalid date
2020-09-14 21:23:26 -07:00
Kebin Liu
cf7712acea
Add HTTP proxy option for subscriptions
2020-09-09 23:28:54 -07:00
alex
0f258fd55b
Make add_invidious_video rule applicable for different invidious instances
2020-09-06 13:41:42 -07:00
Frédéric Guillot
fc75b0cd8e
Add workaround to get YouTube feed from video page
2020-08-02 12:24:46 -07:00
Frédéric Guillot
7380c64141
Add workaround to find YouTube channel feeds
...
YouTube doesn't expose RSS links anymore for new-style URLs.
2020-08-02 11:37:07 -07:00
Frédéric Guillot
1d6b0491a7
Ignore <media:title> in RSS 2.0 feeds
...
In the vast majority of cases, the default entry title is correct.
Ignoring <media:title> avoid overriding the default title if they are different.
2020-06-29 18:24:06 -07:00
Gabriel Augendre
e44b4b2540
Try known urls if no link alternate
...
I came across a few blogs that didn't have a link rel alternate
but offered a RSS/Atom feed.
This aims at solving this issue for "well known" feed urls, since
these urls are often the same.
2020-06-21 20:34:59 -07:00
Manuel Müller
ca918bc7e3
Added scraper rule for dilbert.com and turnoff.us
2020-06-10 20:15:46 -07:00
Frédéric Guillot
6c6ca69141
Add feed option to ignore HTTP cache
2020-06-05 22:04:52 -07:00
Frédéric Guillot
7e5157f218
Rename alternative scheduler to entry_frequency
2020-05-25 15:12:47 -07:00
Shizun Ge
cead85b165
Add alternative scheduler based on the number of entries
2020-05-25 14:06:56 -07:00
Corey McCaffrey
25d4b9fc0c
Added scraper rule for financialsamurai.com
...
The default rule results in blank content.
2020-05-24 13:29:28 -07:00
Corey McCaffrey
0683074b8b
Added scraper rule for TheOatmeal.com
...
The default rule does not show the comic posted to the feed. The comic image is in a div with id "comic".
2020-05-13 21:28:00 -07:00
Corey McCaffrey
8f6c07afd6
Added scraper rule for RayWenderlich.com
...
RayWenderlich.com is a popular developer's community for iOS and Android developers. The default rule results in "GROUP GROUP GROUP GROUP…" instead of the content posted on the blog.
2020-05-13 21:28:00 -07:00
Frédéric Guillot
619aa58fb3
Handle more invalid dates
...
Fixes #617
2020-04-25 20:15:18 -07:00
Frédéric Guillot
592151bdb6
Add support for Invidious
...
- Embed Invidious player for invidio.us feeds
- Add new rewrite rule to use Invidious player for Youtube feeds
2020-03-20 20:56:59 -07:00
Andrew Williams
9974e0f458
Addition of scraper rule for wdwnt.com
...
By default fetching original content for wdwnt.com results in a snippet of the comments section, this rule captures the article content.
2020-02-28 20:24:58 -08:00
Frédéric Guillot
997e9422eb
Ignore enclosures without URL
2020-01-30 21:18:49 -08:00
Frédéric Guillot
61f0c8aa66
Allow application/xhtml+xml links as comments URL in Atom replies
2020-01-04 16:07:06 -08:00
Frédéric Guillot
bf632fad2e
Allow only absolute URLs in comments URL
...
Some feeds are using invalid URLs (random text).
2020-01-04 15:54:16 -08:00
Kebin Liu
8cebd985a2
Use internal XML workarounds to detect feed format
2020-01-02 22:19:15 -08:00
Frédéric Guillot
ac3c936820
Make sure whitelisted URI schemes are handled properly by the sanitizer
2020-01-02 11:03:51 -08:00
Frédéric Guillot
3debf75eb9
Normalize URL query string before executing HTTP requests
...
- Make sure query strings parameters are encoded
- As opposed to the standard library, do not append equal sign
for query parameters with empty value
- Strip URL fragments like Web browsers
2019-12-26 15:56:59 -08:00
Frédéric Guillot
200b1c304b
Improve Dublin Core support for RDF feeds
2019-12-23 14:45:58 -08:00
Frédéric Guillot
1b33bb3d1c
Improve Podcast support (iTunes and Google Play feeds)
...
- Add support for Google Play XML namespace
- Improve existing iTunes namespace implementation
2019-12-23 13:51:42 -08:00
Frédéric Guillot
33fdb2c489
Add support for Atom 0.3
2019-12-22 22:42:00 -08:00
Frédéric Guillot
cfb6ddfcea
Add support for Atom 'replies' link relation
...
Show comments URL for Atom feeds as per RFC 4685.
See https://tools.ietf.org/html/rfc4685#section-4
Note that only the first link with type "text/html" is taken into consideration.
2019-12-22 18:03:04 -08:00
cinput
8e1ed8bef3
Return outer HTML when scraping elements
2019-12-21 21:18:31 -08:00
somini
30f22fbd78
Update scraper rule for "Le Monde"
2019-12-19 18:35:29 -08:00
Jebbs
a155ab6deb
Filter valid XML characters for UTF-8 XML documents before decoding
...
This change should reduce "illegal character code" XML errors.
2019-12-19 18:31:52 -08:00
Frédéric Guillot
a4ebb33cd5
Trim spaces for RDF entry links
2019-12-01 15:06:01 -08:00
Frédéric Guillot
120d6ec7d8
Do no rewrite Youtube description twice in "add_youtube_video" rule
...
This is already done before in <media:description>.
2019-11-30 22:56:06 -08:00
Frédéric Guillot
69aa650203
Add the possibility to add rules during feed creation
2019-11-29 11:27:58 -08:00
Frédéric Guillot
912a98788e
Add support of media elements for Atom feeds
2019-11-28 23:55:40 -08:00
Frédéric Guillot
f90e9dfab0
Add support of media elements for RSS 2 feeds
2019-11-28 21:33:32 -08:00
Frédéric Guillot
c43c9458a9
Add rewrite functions: convert_text_link and nl2br
2019-11-28 21:33:12 -08:00
Neo Ng
90064a8cf0
Update scraper rule for openingsource.org
2019-11-28 19:40:26 -08:00
Tony Wang
2eb2441f2b
Improve XML decoder to remove illegal characters
2019-10-22 20:32:35 -07:00
Tony Wang
5517eebafe
Add new formats to date parser
2019-10-20 09:52:18 -07:00
Frédéric Guillot
36d7732234
Disable strict XML parsing
...
This change should improve parsing of broken XML feeds.
See https://golang.org/pkg/encoding/xml/#Decoder
2019-09-18 22:45:56 -07:00
Frédéric Guillot
934385ff55
Replace Travis by GitHub Actions
2019-09-15 11:48:15 -07:00
Frédéric Guillot
8d8f78241d
Add native lazy loading for images and iframes
...
This feature is available only in Chrome >= 76 for now.
See https://web.dev/native-lazy-loading
2019-09-10 21:22:19 -07:00