Archived
1
0
Fork 0
Commit graph

60 commits

Author SHA1 Message Date
244c9f68e7
refactor: move filters out of parser
Filters are now separate structures to include a product or not based
on their own set of properties. For now, include and exclude filters
are supported. They take a regex as an argument and include a product
if the regex matches (or doesn't match) the product name. This commit
will allow us to create new filters on product like on a price range.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-05-19 17:43:31 +02:00
b6feb2d656
Auto-remove stale products (#30)
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-05-14 16:08:09 +02:00
d6ee2922d7
Create new thread when product is available again (#28)
Instead of spamming a twitter thread with generic replies confusing the
community because the original message was posted long time ago, the bot now
creates a new thread with all product information and an incrementing counter
for uniqueness.
2021-04-23 12:22:18 +02:00
c796cb20f2
Release 0.5.2
- Bugfix typo for compilation
- Handle Twitter duplicates (#20)
- Ignore *.bak files
- Bugfix asynchronous loop
- Add tests for utils.go
- Add MySQL support (#24)
- Bugfix Twitter foreign key
- Include chromium in docker-compose

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-15 15:27:55 +02:00
75ca25ffbb
Bugfix typo for compilation
Used logrus.Logf instead of logrus.Printf.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-15 15:27:46 +02:00
1f5cef17d2
Handle Twitter duplicates (#20)
This commit adds a hash attribute to help identify duplicate messages. Tweets
have a TweetID attribute for the initial thread identifier and a LastTweetID
attribute to keep track of the last reply to eventually continue the thread if a
duplicate is detected.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-15 15:13:00 +02:00
9629483953
Ignore *.bak files
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-15 08:40:39 +02:00
5ba0699adc
Bugfix asynchronous loop
When workers threshold was reached, the current parser was skipped instead of
being processed later. Adding a for loop to retry instead.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-12 09:21:24 +02:00
e57a1c0148
Add tests for utils.go
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-12 08:49:11 +02:00
9f51c62776
Add MySQL support (#24)
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-08 15:20:23 +02:00
d583d559d1
Bugfix Twitter foreign key
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-07 08:46:09 +02:00
0e524b35b8
Include chromium in docker-compose
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-07 08:38:53 +02:00
9b6d3cf745
Release 0.5.1
- Add PostgreSQL support (#23)
- Keep track and remove stale products
- Select or create shop before parsing
- Remove auto increment from primary key
- Enable replies on Twitter and Telegram

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-06 10:35:04 +02:00
47c8a8ace9
Add PostgreSQL support (#23)
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-06 10:33:24 +02:00
b6f23d4bd5
Unify case in help
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-06 09:25:15 +02:00
81258ed935
Keep track and remove stale products
Products not updated since a while are not supposed to stay in the database nor
exposed via the API. This commit automatically updates all detected products
with the current date and adds a "retention" flag (number of days) to remove old
products. This flag is disabled by default.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-06 09:22:22 +02:00
7c2c1032cb
Make the linter happy
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-01 17:57:17 +02:00
ab5abcd171
Select or create shop before parsing
A shop map was created to group URLs by shops and process them in order. Now
that we have Amazon and each URL can be parsed independently, there is no need
to group them anymore. Moreover, shops were passed as an argument to the
handleProducts function. Shop name can be deduced by the parser itself. The
parser has a reference to the database. The parser now select or create the shop
before parsing products.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-01 17:50:50 +02:00
5f69b8435b
Remove auto increment from primary key
Because it's implicit with gorm.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-01 17:13:17 +02:00
305b3eeb76
Enable replies on Twitter and Telegram
By default, when a product is available, a notification is sent. When that same
product is not available, a reply is sent to the original message. With tons of
notifications, replies might be seen as flooding. This commit adds an option to
explicitly enable replies on Twitter and Telegram notifiers. By default, reply
messages are disabled.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-01 13:14:00 +02:00
26eb0af9ea
Release 0.5.0
- Add Amazon support (#3)

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-01 13:02:52 +02:00
5ac5f78ae2
Add Amazon support (#3)
This commit introduces the Amazon support with calls to the Product Advertising
API (PA API). For now, I was only able to use the "www.amazon.fr" marketplace.
I will add more marketplaces when my Amazon Associate accounts will be
validated.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-04-01 12:59:16 +02:00
f994093baf
Release 0.4.0
- Add HTTP API

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-30 18:01:54 +02:00
17a88265c6
Add HTTP API
Add `-api` mode to start the HTTP API with the following routes:
- /health
- /shops
- /shops/:id
- /products
- /products/:id

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-30 18:00:12 +02:00
3122e59325
Release 0.3.1
- Use markdown in Telegram

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-30 08:59:39 +02:00
9ef7ac928e
Use markdown in Telegram
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-24 12:51:58 +01:00
93a37f2f89
Release 0.3.0
- Add Telegram Messenger notifications
- Prepare for new parsers

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-24 09:37:13 +01:00
0a9ae46d19
Add Telegram Messenger notifications
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-24 09:35:37 +01:00
e67ab63ca8
Prepare for new parsers
- Rename "Parser" to "URLParser"
- Make "Parse" function generic
- Rename "crawlShop" function to "handleProducts"
- Reduce "handleProducts" footprint a little bit

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-23 09:00:10 +01:00
9269d59380
Release 0.2.3
- Add Versus Gamers support (#12)
- Add STEG support (#10)
- Update the Twitter reply
- Add Dockerfile

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-23 07:58:44 +01:00
2681e4a427
Add Versus Gamers support (#12)
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-19 17:03:11 +01:00
42b8f50068
Add STEG support (#10)
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-16 15:29:28 +01:00
77230bd0a4
Update the Twitter reply
Twitter reply message updated to comply with the "And It's Gone" meme from 2009:
https://knowyourmeme.com/memes/and-its-gone

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-12 12:01:07 +01:00
21e40e9079
Add Dockerfile
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-09 10:59:26 +01:00
f7d10a00e5
Release 0.2.2
- Add newegg support (#14)
- Truncate too long product names for Twitter
- Configure headless browser address
- Bugfix include and exclude regexes

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-06 07:54:14 +01:00
e47795baf1
Remove leftover query file
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-02 12:39:18 +01:00
384ad0beef
Add newegg support (#14)
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-02 12:37:12 +01:00
92b991c042
Add newegg support (#14)
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-02 09:02:23 +01:00
443fbd9106
Remove useless comment in Twitter tests
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-02 09:01:18 +01:00
9692da6852
Truncate too long product names for Twitter
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-02 08:56:47 +01:00
45025def65
Configure headless browser address
Add `browser_address` configuration setting to define where is the headless
browser instead of relying on the default value.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-01 13:11:58 +01:00
6f002f007d
Bugfix include and exclude regexes
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-03-01 09:06:00 +01:00
14576cde0e
Release 0.2.1
Features:
 - Add Micro Center support (#2)
 - Replace strings with regexp for Twitter hashtags

Bugfixes:
 - Use array of key/value for Twitter hashtags (#1)

Cleanup:
 - Remove useless compileRegex function

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-02-28 15:35:10 +01:00
0902b13705
Add USD currency to price format on Twitter
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-02-28 15:33:59 +01:00
2afd36584b
Add store support for Micro Center (#2)
Local stores are set with an "storeid" param in the query string of the URL
and by a "storeSelected" cookie to avoid garbage in the query strings for
further requests. As product URL is a unique key in the database, Micro Center
is able to handle an URL with the store ID for every product. We can add this
storeid in the list of URLs to parse and job done. Every single Micro Center
local store are parsable.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-02-28 15:11:31 +01:00
42f79c03d4
Add Micro Center support (#2)
As a good start, only the "shippable items" are parsed. Next enhancement would
be to configure local shops.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-02-28 10:33:28 +01:00
5e19efbad8
Replace strings with regexp for Twitter hashtags
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-02-28 09:45:19 +01:00
4342f65211
Use array of key/value for Twitter hashtags (#1)
With Go, maps are not ordered the way they are declared. Keys can be read at any
order. When a pattern is too large ("rtx 3060"), when placed first, it can match
a name of another product ("rtx 3060 ti"). When placed second, the good hashtag
is chosen. This commit uses an array of maps because arrays are ordered.

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-02-27 18:34:13 +01:00
40b0ba999a
Remove useless compileRegex function
Signed-off-by: Julien Riou <julien@riou.xyz>
2021-02-27 15:14:21 +01:00
3a4aba93e5
Release 0.2.0
- new language: go
- new shops: cybertek.fr, mediamarkt.ch
- deprecated shops: alternate.be, minershop.eu
- improved database transaction management
- better web parsing library (ferret, requires headless chrome browser)
- include or exclude products by applying regex on their names
- check for PID file to avoid running the bot twice
- hastags are now configurable

Signed-off-by: Julien Riou <julien@riou.xyz>
2021-02-27 08:10:43 +01:00