Be advised; this thread is not for serious in depth discussion of weighty topics, this thread is not for anything Culture War related. This thread is for Fun. You got jokes? Share 'em. You got silly questions? Ask 'em.
- 104
- 4
What is this place?
This website is a place for people who want to move past shady thinking and test their ideas in a
court of people who don't all share the same biases. Our goal is to
optimize for light, not heat; this is a group effort, and all commentators are asked to do their part.
The weekly Culture War threads host the most
controversial topics and are the most visible aspect of The Motte. However, many other topics are
appropriate here. We encourage people to post anything related to science, politics, or philosophy;
if in doubt, post!
Check out The Vault for an archive of old quality posts.
You are encouraged to crosspost these elsewhere.
Why are you called The Motte?
A motte is a stone keep on a raised earthwork common in early medieval fortifications. More pertinently,
it's an element in a rhetorical move called a "Motte-and-Bailey",
originally identified by
philosopher Nicholas Shackel. It describes the tendency in discourse for people to move from a controversial
but high value claim to a defensible but less exciting one upon any resistance to the former. He likens
this to the medieval fortification, where a desirable land (the bailey) is abandoned when in danger for
the more easily defended motte. In Shackel's words, "The Motte represents the defensible but undesired
propositions to which one retreats when hard pressed."
On The Motte, always attempt to remain inside your defensible territory, even if you are not being pressed.
New post guidelines
If you're posting something that isn't related to the culture war, we encourage you to post a thread for it.
A submission statement is highly appreciated, but isn't necessary for text posts or links to largely-text posts
such as blogs or news articles; if we're unsure of the value of your post, we might remove it until you add a
submission statement. A submission statement is required for non-text sources (videos, podcasts, images).
Culture war posts go in the culture war thread; all links must either include a submission statement or
significant commentary. Bare links without those will be removed.
If in doubt, please post it!
Rules
- Courtesy
- Content
- Engagement
- When disagreeing with someone, state your objections explicitly.
- Proactively provide evidence in proportion to how partisan and inflammatory your claim might be.
- Accept temporary bans as a time-out, and don't attempt to rejoin the conversation until it's lifted.
- Don't attempt to build consensus or enforce ideological conformity.
- Write like everyone is reading and you want them to be included in the discussion.
- The Wildcard Rule
- The Metarule
Jump in the discussion.
No email address required.
Notes -
What are they training it on? Only a few hundred k, so presumably they're not throwing the entire e621 catalog at it yet.
From my understanding, the major trainers have largely downloaded a subset of e621 data, filtered by upvote score, and then by content. Furry and Yiffy to both SFW and less-extreme-kink NSFW, with different thresholds and limits. Zach3d on 'texture' fetish and a few specific species mixed with a small subset of general pictures with a higher upvote score. I think most have also filtered out material that they think is likely to cause artifacting, either technical stuff like severe jpg compression, or many-panel comic pieces.
So it's relatively easy to grab sets and train a model, to the point that groups of amateurs can do it? That's good news: I had a vision of all art production being censored by the kind of political commissars who inevitably take over large projects.
The idea of being able to restrict artists at the canvas level must be making some of them drool.
As far as I know, each of these datasets has been curated by one person, to their respective tastes. Hasuwoof for Yiffy, DirtyApples for Furry, and Zach for Zach3d. e621 is well-enough tagged for high-score posts that it seems fairly automatable, and as long as you're not abusing the download process, it's hard to tell a normal user from an archiver, especially if you filter before download. And the code itself is... not fun, since it's poorly documented python in most parts, but it's nothing ridiculous.
((There's a My Little Pony-specific one that's supposed to have been released recently, but I know less about that.))
There's been some discussion of setting up teams for difficult heavy lifting (eg, improving tagging, building and parsing datasets with more eyes-on-curation), but the big issue for now are cost and technical accessibility. The core model is expensive because it took literally millions of steps in a large dataset, but further tuning is relatively cheap, with most epochs taking less than a day on a single (beefy) cloud GPU server. But getting the data together and onto that machine rapidly enough can be complex to do right, and easy to end up with a staggering AWS bill if done wrong.
That'll be less an issue if newer GPU generations continue to bulk up on VRAM; if done at home, it's mostly an energy (and/or cooling) bill thing. And that might be coming as soon as this winter for people willing to splurge on the higher-VRAM versions of the 4090.
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link
More options
Context Copy link