Lemmy Benchmarking - Working on tooling

vapeloki@lemmy.ml · 1 year ago

Lemmy Benchmarking - Working on tooling

RoundSparrow@lemmy.ml · 1 year ago

Also, Ratelimiting per IP is an issue.

I have concerns about Lemmy having a pattern of hiding these behaviors under the cover. In other words, people running servers having no kind of operator console to know that it is happening. Ideally to me, it would be a setting to adjust in a screen to set for an instance to disable/set threshold. If one doesn’t exist, maybe we can identify where in the code the limit is enforced and hand-edit the code.

RoundSparrow@lemmy.ml · edit-2 1 year ago

One of the big concerns I have is that there seems to be no sense of the problems being faced. The project was built around very little data for years, and growing pains abound.

As of today, lemmy.ml says this is the posting with the most comments (local), 852: https://lemmy.ml/post/1186515 This federated posting from Beehaw has over 1000: https://lemmy.ml/post/1265302

On Reddit, a “large” news event, such as the discovery of the Titanic submarine this week, can have 10,000 comments - https://old.reddit.com/r/news/comments/14g7ipn/debris_field_discovered_within_search_area_near/

And that isn’t even a major news breaking event on the order of a terrorist attack, Japan earthquake/nuclear incident, famous person being shot, etc.

vapeloki@lemmy.ml · 1 year ago

Yes, i see this issue also. I would assume that the statements used here, tend to get very bad plans, due to overhang (specific id’s will have far more entries then others).

This is one of the reasons for my current setup.

But when it comes to optimizing databases, i think i’m pretty skilled in it, and i have seen much worse scenarios (billing systems, processing > 100.000.000 entries per billing run, with tough time constraints).

RoundSparrow@lemmy.ml · 1 year ago

Just now they found out that Lemmy is falling over with 300 comment threads.

vapeloki@lemmy.ml · 1 year ago

Thx for the heads up. Now, we are talking database ;)

phiresky@lemmy.world · 1 year ago

Hey! I’m working on a rust tool right now to import a month of reddit dump into a lemmy instance using the federation api (for benchmarking / load testing as well).

vapeloki@lemmy.ml · 1 year ago

Nice! This would greatly help to populate the database and get much better results!

phiresky@lemmy.world · 1 year ago

my reddit import code is here for the time being: https://github.com/phiresky/lemmy/tree/reddit-importer

it works but is undocumented