The catarrhine who invented a perpetual motion machine, by dreaming at night and devouring its own dreams through the day.

  • 9 Posts
  • 899 Comments
Joined 10 months ago
cake
Cake day: January 12th, 2024

help-circle



  • When it comes to how people feel about AI translation, there is a definite distinction between utility and craft. Few object to using AI in the same way as a dictionary, to discern meaning. But translators, of course, do much more than that. As Dawson puts it: “These writers are artists in their own right.”

    That’s basically my experience.

    LLMs are useful for translation in three situations:

    • declension/conjugation table - faster than checking a dictionary
    • listing potential translations for a word or expression
    • a second row of spell/grammar-proofing, just to catch issues that you didn’t

    Past that, LLM-based translations are a sea of slop: they screw up with the tone and style, add stuff not present in the original, repeat sentences, remove critical bits, pick unsuitable synonyms, so goes on. All the bloody time.

    And if you’re handling dialogue, they will fuck it up even in shorter excerpts, by making all characters sound the same.




  • The drop is slowing down considerably:

    Month Users Change from previous month in %
    Mar 53687 N/A N/A
    Apr 51298 -2389 -4.5%
    May 48832 -2466 -4.8%
    Jun 48472 -360 -0.74%
    Jul 47297 -1175 -2.4%
    Aug 47876 +579 +1.2%
    Sep 47227 -649 -1.4%
    Oct 45037 -2190 -4.6%
    Nov 44837 -200 -0.44%

    And given that March was a peak, I’m tempted to interpret it as newbies not sticking around. I think that it’ll plateau around 40k users, then provided that the conditions remain the same it won’t increase or decrease.

    That’s why I say that it’s stable - the core userbase will likely stick around.

    That said, these numbers may particularly be bad, e.g. if anyone left Lemmy and went to Mbin and/or PieFed, then I think they would not be counted in those charts?

    They wouldn’t be counted but I don’t think that this introduces a lot of inaccuracy. Mbin has 1.7k MAUs, and PieFed has 104.

    The number of instances dropping is far more concerning IMO. It means that smaller instances have a hard time becoming sustainable.


  • Just as additional info, as this doesn’t answer your question (sorry!):

    I’ve seen constructions like this popping up in other languages, under different names. In Portuguese for example it’s called “pronome de interesse” (pronoun of interest) or “dativo ético” (ethical dative). Often used alongside commands, like this:

    • Por favor, não [me]¹ conte [para os outros]² [o que aconteceu]³.
    • Please, [for my sake]¹, don’t tell [the others]² [what happened]³.

    I’ve tagged 1 = the ethic dative, 2 = the indirect object, and 3 = the direct object. Since the verb (contar, to tell) already got its two objects, that ⟨me⟩ cannot be an object requirement of the verb.

    It’s typically associated with informal speech, but attested across multiple dialects (see e.g. this and this). And apparently it backtracks all the way into Latin.

    German and Ancient Greek also show the same phenomenon.

    Based on that I’d probably guess that what you’re seeing in English is the leftover of some really old feature, so it’ll probably surface across multiple dialects, even if Dixie English sticks with it a bit more.







  • I don’t think that handedness plays a huge role. I think that in some cases it’s simply random, and in other cases it’s “we write in this direction because that’s how we learned it”.

    Inkwriting exists since at least the 2500 BCE, it was already used with hieroglyphs, and yet you see those being written left to right, right to left, boustrophedon, it’s a mess. Even with the Greek alphabet, people only stopped using boustrophedon so much around 300 BCE or so.

    Plus if it played a role we’d see the opposite of what we see today - since the Arabic abjad clearly evolved among people who wrote with ink, that’s why it’s so cursive. In the meantime the favourite customary writing medium for Latin was wax tablets, where smudging ink is no issue:


  • As others said it was a conscious decision of the developers, as it’s gamification of the system and they aren’t big fans of that.

    I agree with this decision.

    The Fluff Principle* makes easy-to-judge content get higher scores, and we do see it Lemmy. It isn’t a big deal because fluff ends on its own specific comms, but once you gamify the aggregation of score points, the picture changes - now you’re encouraging people to share content that they believe to score high over content that they believe to be contributive.

    Additionally a publicly visible karma enables a bunch of poorly thought mod practices, like karma gating (“you need +500 karma to post here lol”) or automatically banning people with low karma (even if it might come from a single post/comment).

    *“Hence what I call the Fluff Principle: on a user-voted news site, the links that are easiest to judge will take over unless you take specific measures to prevent it.” (Source)


  • And sadly, my Twitter/𝕏 thread with the company in private message is going nowhere. 😿

    Sometimes “friendly” “reminding” a company about the relevant laws does wonders, making them return such “display” of “attentiveness” in a more timely manner. (Translation: they reply faster if you threaten them with the specific law.)

    In this case the Ley Federal de Protección al Consumidor, article 17, second paragraph got you covered:

    El consumidor podrá exigir directamente a proveedores específicos y a empresas que utilicen información sobre consumidores con fines mercadotécnicos o publicitarios, no ser molestado en su domicilio, lugar de trabajo, dirección electrónica o por cualquier otro medio, para ofrecerle bienes, productos o servicios, y que no le envíen publicidad. Asimismo, el consumidor podrá exigir en todo momento a proveedores y a empresas que utilicen información sobre consumidores con fines mercadotécnicos o publicitarios, que la información relativa a él mismo no sea cedida o transmitida a terceros, salvo que dicha cesión o transmisión sea determinada por una autoridad judicial


  • You // need // some // Xanax // /s 😁🍻

    NOOOOOOOOOOOOOOO STOP IT!!! /s

    Serious now. It doesn’t work here, since there’s no audible ping for every reply that you sent me. It’s more like in whatsapp*: I definitively don’t want to mute some people, but I wish that they didn’t send me multiple short messages.

    *inb4 I hate whatsapp but not having it in Brazil is social suicide.


  • Messaging:

    • People who reply to direct text questions with 5min audio recordings.
    • People who use Enter as if it was the space bar, sending 10 messages for what could be easily sent as one.
    • People who treat their requests as of utmost urgency, but when you contact them back take hours or even days to reply back.

    Online forums:

    • The sort of illiterate fuck who treats “but” as if it contradicted everything preceding it.
    • People who feel entitled to have ELI5 versions of the text content produced by other people. (i.e. throwing a tantrum because of difficult words, text size, or even conceptual complexity.)
    • Usage of “lol” and/or “lmao”. (I mentally translate those into “I’m braindead and should be treated accordingly.”)
    • The sort of dead weight that focuses too much on specific words being used to convey something, instead of what it conveys.


  • For real. Companies being extra pushy with their product always makes me picture their decision makers saying:

    “What do you mean, «we’re being too pushy»? Those are customers! They are not human beings, nor deserve to be treated as such! This filth is stupid and un-human-like, it can’t even follow simple orders like «consume our product»! Here we don’t appeal to its reason, we smear advertisement on its snout until it needs to open the mouth to breath, and then we shove the product down its throat!”

    Is this accurate? Probably not. But it does feel like this, specially when they’re trying to force a product with limited use cases into everyone’s throats, even after plenty potential customers said “eeew no”. Such as machine text and image generation.