this post was submitted on 09 Jun 2023
2 points (100.0% liked)
Lemmy
12524 readers
1 users here now
Everything about Lemmy; bugs, gripes, praises, and advocacy.
For discussion about the lemmy.ml instance, go to !meta@lemmy.ml.
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Ok, thanks. My response times are way less than that unless there is some other issue.
I have been dealing with a lot of rate limit errors in lemmy backend, and that may have been my fault. I'm using proxy protocol in front of Nginx, and I didn't have the right variable set to populate the XFF header from that (
$remote_addr
vs$proxy_protocol_addr
). I usually set those via aninclude
for common proxy headers, but had to remove that include and add the lines manually as some of my defaults were breaking Lemmy UI.So it may be possible my instance was erroneously rate-limiting the crawl which contributed to that? I made the fix to that Nginx config about half an hour ago. I'll wait several hours and re-check the community browser to see if that was the cause.
One last question: Is that crawler based on any kind of existing project I can pull and fork a new project from?
Thanks for your insight.
might be an option.
and the more instances link to yours, the more likely it will be crawled, in case other instances drop out.
i 'd have to update the repo at codeberg, but right now it has low prio for me as i broke my hand and try to stay away from the keyboard ^^
I've got a lot of peers in my instances list (lemmy.ml, beehaw.org, feddit.de are the largest peers), and I've clicked into the
/instances
page of many of them; they all show my domain as a peer, so I believe I'm good there.Well, I definitely thank you for powering through to answer my dumb questions, and I do appreciate your time.
thx, i'll send you a note if there is any update
@ptz@dubvee.org Seems your instance got crawled finally
There it is!
Thanks for checking back.
I want to say the blame lies 100% with me on this one.
When i moved over to the new domain, I forgot to disable my default WAF policy for
robots.txt
which is set to disallow all. Realized that late yesterday evening and turned that rule off in my load balancer.