• 1 Post
  • 23 Comments
Joined 1 year ago
cake
Cake day: March 4th, 2023

help-circle

  • It was just an example. The same can happen at the Mastodon-level instead of the Fediverse-level. Since there is some desired interop (e.g. between Mastodon and Lemmy), services do influence each other in their feature set.

    I’m not sure what you mean by “a lot of what people are worried about Threads doing has already been done by Mastodon”. Do you mean that the decisions that Mastodon make influence the rest of the Fediverse? If so, let’s make sure we understand the difference here: Threads has a much more hostile disposition. Mastodon seems to have incentives aligned with the rest of the Fediverse services, and probably deserves the benefit of the doubt; Facebook has abused that benefit time and time again.


  • That doesn’t actually fix the issue. If Facebook is trying to set itself up like Chrome with the webplatform, or GTalk with XMPP, then they will drive the feature set of ActivityPub, whether you’re federated with them or not.

    Hypothetical example:

    Want to see this picture/video from someone on Threads? You need Facebook’s proprietary picture format, which has DRM baked in it. Even if you don’t federate, Mastodon, Lemmy, etc now have to take energy away from their work to adopt the proprietary picture format. It depends on the proportion Threads takes on the network and how they can leverage that position to put pressure.

    Threads currently has voice notes. Should all ActivityPub services support that? If so, do we adhere to Threads’ standard or not?






  • Essentially, you don’t ask them to use their internal knowledge. In fact, you explicitly ask them not to. The technique is generally referred to as Retrieval Augmented Generation. You take the context/user input and you retrieve relevant information from the net/your DB/vector DB/whatever, and you give it to an LLM with how to transform this information (summarize, answer a question, etc).

    So you try as much as you can to “ground” the LLM with knowledge that you trust, and to only use this information to perform the task.

    So you get a system that can do a really good job at transforming the data you have into the right shape for the task(s) you need to perform, without requiring your LLM to act as a source of information, only a great data massager.








  • I don’t think that Sarah Silverman and the others are saying that the tech shouldn’t exist. They’re saying that the input to train them needs to be negotiated as a society. And the businesses also care about the input to train them because it affects the performance of the LLMs. If we do allow licensing, watermarking, data cleanup, synthetic data, etc. in a way that is transparent, I think it’s good for the industry and it’s good for the people.


  • That’s always been the case, though, imo. People had to make time for art. They had to go to galleries, see plays and listen to music. To me it’s about the fair promotion of art, and the ability for the art enjoyer to find art that they themselves enjoy rather than what some business model requires of them, and the ability for art creators to find a niche and to be able to work on their art as much as they would want to.