Artificial intelligence is worse than humans in every way at summarising documents and might actually create additional work for people, a government trial of the technology has found. Amazon conducted the test earlier this year for Australia’s corporate regulator the Securities and Investments Commission (ASIC) using submissions made to an inquiry. The outcome of the trial was revealed in an answer to a questions on notice at the Senate select committee on adopting artificial intelligence. The test involved testing generative AI models before selecting one to ingest five submissions from a parliamentary inquiry into audit and consultancy firms. The most promising model, Meta’s open source model Llama2-70B, was prompted to summarise the submissions with a focus on ASIC mentions, recommendations, references to more regulation, and to include the page references and context. Ten ASIC staff, of varying levels of seniority, were also given the same task with similar prompts. Then, a group of reviewers blindly assessed the summaries produced by both humans and AI for coherency, length, ASIC references, regulation references and for identifying recommendations. They were unaware that this exercise involved AI at all. These reviewers overwhelmingly found that the human summaries beat out their AI competitors on every criteria and on every submission, scoring an 81% on an internal rubric compared with the machine’s 47%. Human summaries ran up the score by significantly outperforming on identifying references to ASIC documents in the long document, a type of task that the report notes is a “notoriously hard task” for this type of AI. But humans still beat the technology across the board. Reviewers told the report’s authors that AI summaries often missed emphasis, nuance and context; included incorrect information or missed relevant information; and sometimes focused on auxiliary points or introduced irrelevant information. Three of the five reviewers said they guessed that they were reviewing AI content. The reviewers’ overall feedback was that they felt AI summaries may be counterproductive and create further work because of the need to fact-check and refer to original submissions which communicated the message better and more concisely.
3 September 2024
many thanks for inviting me to do one :)) !!
@caustic-caffeine and I did a collab!! :)
gave my pinned post an overhaul so it has the ✨fancy✨ without being too over the top. plus added more organizational tags for funsies 😁
figured i’d add to this! here is a process reel going into how i generally do things, and an unlisted youtube video with the speedpaint and additional thoughts is in the works! (because i’m a masochist like that…/nsrs)
obviously i’m still working to improve and this isn’t a tutorial per say; it’s just explaining how i do things, since a few people wanted to know and it seemed fun to talk about lmao
(also, this is fran’s mermay ref, not canonical in the slightest but i figured i’d make one for funsies yk?)
[francis, ayelet (ayelet’s @boxxed-upkr ‘s)]
and so it begins. random tumblr posts pinterest put in my feed but as my ocs.
(if any of you have recs please tag me under the posts teehee)
peepeepoopoo first set of artfight refs are done
These bios leave so much out I want to cry, but I think it gives an idea of what’s going on
Specifically what’s being left out is all the worldbuildy info on how their cultures and religioms shaped their worldviews and how they clash. As well as EVERYTHING to do with Sol’s marriage. The whole concept of the ashborn mothergod birthing perfect children and that it’s the responsibility of each child to stay that way. The sherit gods and the insane demands of their doctrine
Look the point is I’ve put a lot of thoughts into these characters and their world and I am going hogwild rn
I am Amira from Gaza🍉. The war destroyed everything I own. I lost my father, my home, my job, and my university. I can't bear any more. Could you please donate or share my story to help protect us 🙏?
i wish you the best of luck with everything!