Notices by feld (feld@bikeshed.party), page 4

feld (feld@bikeshed.party)'s status on Wednesday, 17-Jul-2024 09:01:38 JST feld
in reply to
@mint relays being used too?

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Wednesday, 17-Jul-2024 00:20:50 JST feld
in reply to
@mint if it's interesting to you, I forked the Oban Lifeline plugin and made a new one called Lazarus which is configured to revive a stuck/dead/orphaned job even if it was at its last attempt due to max_attempts: 1

if the job has failed multiple times though, it lets it go

the original Lifeline plugin would throw away a bunch of our jobs just because they're max_attempts: 1 and it's not super necessary to have them tried again, just dumb that if they failed hard for a totally unhandled reason it wouldn't try again

https://git.pleroma.social/pleroma/elixir-libraries/oban_plugins_lazarus
In conversation about 6 months ago from bikeshed.party permalink
Attachments
1. Pleroma / Elixir libraries / oban_plugins_lazarus · GitLab
  
  Pleroma GitLab
feld (feld@bikeshed.party)'s status on Tuesday, 16-Jul-2024 02:26:31 JST feld
in reply to
@mint I found it's possible for background queue to get stuck because of super long timeout (15 mins) and some other jobs which were missing timeouts (defaults to infinity), so I've fixed these issues. Some other tweaks in here too.

These changes do not have anything directly to do with the ReceiverWorker, but it may be possible that Oban is not scheduling those jobs because of existing running jobs being stuck. This is unclear to me and doesn't feel like it should work that way in the BEAM, so it could be an Oban-specific behavior with how it is choosing to execute available work.

Investigation is still ongoing until I am certain nothing else could be causing this.

https://git.pleroma.social/pleroma/pleroma/-/merge_requests/4176
In conversation about 6 months ago from bikeshed.party permalink
Attachments
1. Oban improvements (!4176) · Merge requests · Pleroma / pleroma · GitLab
  
  add missing Oban timeouts fix some jobs returning :error instead of :cancel (due to disabled features or other conditions) move PurgeExpiredActivity to background queue...
feld (feld@bikeshed.party)'s status on Saturday, 13-Jul-2024 03:36:14 JST feld
in reply to
@mint I am actively investigating this, trying to find any possible reason this is happening.

My best guess so far is orphaned jobs making Oban think it can't run more jobs because they're dead / stuck in "executing" state.

This should really never happen because Oban itself doesn't crash, but I guess if you restarted Pleroma and it didn't clean itself up gracefully this could happen.

Any chance some of these are Docker deployments or the service could have crashed and restarted automatically due to low resources (OOM, etc)?

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Friday, 12-Jul-2024 23:30:08 JST feld
in reply to
@mint the other queues were reasonably sized ?

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Friday, 12-Jul-2024 16:59:30 JST feld
in reply to
@mint In that issue you said it's the ReceiverWorker queue getting stuck and just piling up with jobs, right?

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Tuesday, 09-Jul-2024 22:13:55 JST feld
in reply to
@tusooa @lain @lanodan @hj @izaya

I think it's possible and we should support it.

Phoenix channels, sessions, etc are multi node aware

Oban is multi node aware

Cachex can be made multi-node aware

ConfigDB changes could be made into broadcasts

Other things could probably be solved with Singleton

Supporting this should finally give us Pleroma 3.0

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Tuesday, 09-Jul-2024 02:09:28 JST feld
in reply to
@mint

> Hackney not being able to follow redirects when using HTTP proxy.

i know that's unrelated, but can you send me a diff of that ?

also very curious if this problem exists (the main issue, not redirects) when you use Gun instead of Hackney

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Tuesday, 09-Jul-2024 01:59:30 JST feld
in reply to
@mint you're running latest-ish develop branch and using the default Tesla HTTP client (Hackney) I presume?

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Tuesday, 09-Jul-2024 00:46:49 JST feld
in reply to
@mint I'd look through the args column of these that are mysteriously timing out, grab an activity/object URL, and manually try to run:

Pleroma.Object.Fetcher.fetch_object_from_id("URL_HERE")

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Tuesday, 09-Jul-2024 00:19:48 JST feld
in reply to
- feld
@mint would be nice to see what the errors are with something like...

SELECT errors FROM oban_jobs
WHERE worker = 'Pleroma.Workers.ReceiverWorker'
AND state = 'executing'
AND attempt > 1;

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Tuesday, 09-Jul-2024 00:19:13 JST feld
in reply to
@mint I'm misunderstanding the problem; what's happening with the server?

edit: ReceiverWorker high job count meaning you're seeing federation halt?

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Monday, 08-Jul-2024 23:42:46 JST feld
in reply to
@mint you mean you're seeing high binary memory usage?

In conversation about 6 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Friday, 05-Jul-2024 23:20:15 JST feld
in reply to
@lina @creamqueen @mint @coolboymew it's not a substitute; it just works differently.

butter, margarine, and lard produce completely different pie crust textures. (butter flaky, lard gritty, marg somewhere inbetween)

In conversation about 7 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Friday, 05-Jul-2024 23:18:05 JST feld
in reply to
@mint @creamqueen @coolboymew if it was good enough for Napoleon it's good enough for me

In conversation about 7 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Friday, 05-Jul-2024 23:17:24 JST feld
in reply to
- cream queen
- cool_boy_mew
@creamqueen @coolboymew dairy fats are punk as fuck

In conversation about 7 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Friday, 05-Jul-2024 23:16:55 JST feld
in reply to
- cream queen
- cool_boy_mew
@creamqueen @coolboymew I almost always have butter sitting out. I used to have a butter crock but I lost it when I moved
In conversation about 7 months ago from bikeshed.party permalink
Attachments
1. Untitled attachment
  https://media.bikeshed.party/pleroma/b3f98571755592af17fa9d880605cb2732c1eb8e46d97540966a88cdbd1744fd.png
2. Untitled attachment
  https://media.bikeshed.party/pleroma/2fb8f00fd02eb1005847238277eb8bad23c997b6a6f96643338b93bbf724bfa4.png
feld (feld@bikeshed.party)'s status on Friday, 05-Jul-2024 23:16:54 JST feld
in reply to
- cream queen
- cool_boy_mew
@creamqueen @coolboymew btw if you only use it for cooking, buy unsalted butter. It works better. You'll have to compensate with a little salt in your recipes, but salted butter retains water which is not what you want when you're cooking/baking/frying.

In conversation about 7 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Tuesday, 02-Jul-2024 22:51:41 JST feld
in reply to
@mint aha! Delete those

DELETE FROM oban_jobs WHERE worker = 'Pleroma.Workers.UserRefreshWorker';

I guess they pulled develop branch between my change and the fix for that 😭 I set those jobs to be unique and the unique setting wasn't being honored

In conversation about 7 months ago from bikeshed.party permalink
feld (feld@bikeshed.party)'s status on Tuesday, 02-Jul-2024 22:51:15 JST feld
in reply to
- :blank:
@i @mint I think it could be reasonably common for there to be stale "executing" jobs in the table that are left indefinitely and should be recycled due to them remaining from a crash/failure/unclean shutdown. I check mine occasionally but so far I haven't found any.

In conversation about 7 months ago from bikeshed.party permalink

After
Before

Public

Notices by feld (feld@bikeshed.party), page 4

User actions

Following 0

Followers 0

Groups 0

Statistics

Feeds