My job is castle!
I believe your “They use attention mechanisms to figure out which parts of the text are important” is just a restatement of my “break it into contextual chunks”, no?
Large language models literally do subspace projections on text to break it into contextual chunks, and then memorize the chunks. That’s how they’re defined.
Source: the paper that defined the transformer architecture and formulas for large language models, which has been cited in academic sources 85,000 times alone https://arxiv.org/abs/1706.03762
Yeah I agree. However, Battle Tendency may be hard to jump into it somebody skipped part 1.
People like to bring up the first Jojo parts as an example of skippable content, but I also want to emphasize part 1 & part 2 almost have less episodes (26 combined) than half of part 3 (48 episodes), so it’s hardly skipping anything since there are 6 animated parts.
What too much NCD does to a mf
bruh
Yeah but Facebook probably has access to the other person’s contacts where your name and phone number were stored
I heard it in a coffee shop just the other day. Several customers and employees complained and the manager skipped the song all in about 30 seconds