Llama 4: le nuove AI di Meta tra multimodalità, efficienza e neutralità nei contenuti

Llama 4: le nuove AI di Meta tra multimodalità, efficienza e neutralità nei contenuti
Der Einsatz von #GPT4 in der #Diagnostik zeigt Potenzial, doch aktuelle Studien belegen: Ohne gezielte #Schulung und klare Vorgaben bringt die Integration in den #Klinikalltag kaum Vorteile. Was ist nötig, um die Ergebnisse zu verbessern? Inwiefern kann die #KI Ärzt*innen unterstützen?
Suche Entwickler:in für ein KI-basiertes Analyse-Tool rund um Sprache & Wirkung. Kein KI-Gimmick, sondern ein Projekt mit Haltung – und echtem Nutzen für Medienmenschen & Leser:innen. Interesse? Schreib mir: kontakt@evalist.de
Quora’s Poe introduces a new $5/month premium tier and a $250/month plan for access to advanced models like GPT-4.5 and o1-pro. Perfect for users seeking cutting-edge AI tech! #AI #QuoraPoe #GPT4 #AIModels #TechNews #ArtificialIntelligence #MachineLearning #Innovation
AI
OpenAI Unveils Costly "o1-pro" Model
"o1-pro" promises improved reasoning but costs $150/million input tokens & $600/million output tokens.
Early tests show better reliability but struggles with logic puzzles.
Twice as costly as GPT-4.5 and 10x pricier than standard "o1".
How easy is to persuade a large language model?
The Interchange Forum for Reflecting on
Intelligent Systems (IRIS @Stuttgart_IRIS @Uni_Stuttgart) invites all interested students and university staff to their next IRIS colloquium on March 26 at 2 PM in the room UN 302.101, where Mara Seyfert will talk about uncertainty and robustness against persuasion in large language models.
We cordially invite all interested parties at the Universität Stuttgart to our IRIS Colloquium on March 26, at 2 p.m., in Room 101 at Universitätsstr. 32.
Mara Seyfert will give her lecture, „Uncertainty and robustness against persuasion in large language models.“
Today's large language models (LLMs) excel at providing convincing answers across a broad spectrum of inquiries, with their conversational capabilities enabling them to closely align with users' needs. However, this adaptability is beneficial only to the extent as models remain robust to adopting wrong statements from user inputs.
Recent research demonstrates that even advanced models like GPT-4 can shift from initially correct answers to incorrect ones during multi-turn conversations solely due to user input. In my talk, I will present my research exploring how uncertainty in LLMs can provide insights into their robustness against persuasion while highlighting the specific challenges of quantifying uncertainty in these models.
The lecture is held in English. Registration is not necessary.
#LargeLanguageModels #AI #GPT #GPT4 #robustnessagainstpersuasion #AIResearch #MachineLearning #LLMs #ArtificialIntelligence #TechTalk #RobustAI #DataScience #AIethics #Innovation
What Happened When #Conspiracy Theorists Talked to OpenAI's GPT-4 Turbo? - Slashdot
#ai #openai #gpt4 #gpt4turbo
AI
OpenAI’s GPT-4.5: High Cost, Low Impact?
30x pricier than GPT-4o yet delivers minimal improvements & struggles with coding.
Altman signals end of traditional LLMs, shifting focus to hybrid reasoning AI (o3).
Claude 3.7 Sonnet outperforms GPT-4.5, marking a competitive AI shake-up.
#AI #OpenAI #GPT4.5 #Claude3 #Tech
Unveiling GPT-4.5: A Leap Towards Emotionally Intelligent AI
OpenAI's latest model, GPT-4.5, marks a significant evolution in AI technology, emphasizing emotional intelligence and human alignment. With advancements in multimodal capabilities and a focus on ethi...
https://news.lavx.hu/article/unveiling-gpt-4-5-a-leap-towards-emotionally-intelligent-ai
Elon Musk’s AI Revolution Continues as xAI Unveils Grok 3 AI Model
#AIModels
#TechNews
#AICompetition
#FutureOfAI
#GPT4
#GoogleGemini
https://www.techi.com/elon-musks-xai-launches-grok-3-ai-model/
Unlocking Business Potential: How OpenAI's Operator Transforms Workflows
OpenAI's new AI agent, Operator, is revolutionizing the way small business owners automate tasks and streamline operations. With its advanced capabilities powered by GPT-4, this tool not only saves ti...
https://news.lavx.hu/article/unlocking-business-potential-how-openai-s-operator-transforms-workflows
#KINutzen
Der Einsatz von #GPT4 in der #Diagnostik zeigt Potenzial, doch aktuelle Studien belegen: Ohne gezielte #Schulung und klare Vorgaben bringt die Integration in den #Klinikalltag kaum Vorteile. Vor allem professionelles #PromptEngineering könnte die Ergebnisse verbessern. #KI kann Ärzt*innen unterstützen, ersetzt aber weder #Expertise noch #Verantwortung. Transparenz und ethische Standards sind essenziell.
"Separately, the authors also tested several contemporaneous large language models (GPT-4, GPT-3.5 and Llama 3 8B). GPT-4's edit summaries in particular were rated as significantly better than those provided by the human Wikipedia editors who originally made the edits in the sample – both using an automated scoring method based on semantic similarity, and in a quality ranking by human raters (where "to ensure high-quality results, instead of relying on the crowdsourcing platforms [like Mechanical Turk, frequently used in similar studies], we recruited 3 MSc students to perform the annotation").
This outcome joins some other recent research indicating that modern LLMs can match or even surpass the average Wikipedia editor in certain tasks (see e.g. our coverage: "'Wikicrow' AI less 'prone to reasoning errors (or hallucinations)' than human Wikipedia editors when writing gene articles").
A substantial part of the paper is devoted to showing that this particular task (generating good edit summaries) is both important and in need of improvements, motivating the use of AI to "overcome this problem and help editors write useful edit summaries":"
https://meta.wikimedia.org/wiki/Research:Newsletter/2025/January
Decoding Strategic Behavior in Large Language Models: A Game-Theoretic Analysis
Recent research delves into the strategic decision-making capabilities of leading large language models (LLMs) like GPT-3.5, GPT-4, and LLaMa-2 through the lens of game theory. By exploring their resp...
Moderne #KI-Modelle verblüffen mit ihrer Leistungsfähigkeit: Sie lösen komplexe Aufgaben, analysieren wissenschaftliche Texte und schreiben sogar Gedichte – sachlich präzise und sprachlich elegant. Doch ein neuer Test, "Humanity's Last Exam", zeigt die Grenzen dieser Technologie auf. Selbst Spitzenmodelle wie #GPT4 und Google #Gemini scheitern in vielen Bereichen. Interview mit Sören Möller zu dem Test, der selbst die besten KI-Modelle scheitern lässt.
https://www.fz-juelich.de/de/aktuelles/news/pressemitteilungen/2025/humanitys-last-exam-bringt-ki-an-ihre-grenzen
#KINutzen #Retröt
#KünstlicheIntelligenz kann effektiv #Verschwörungstheorien widerlegen. Durch gezielte Argumentation sank der Glaube an solche Theorien bei den Teilnehmenden um 20%. Die Chats hatten auch eine nachhaltige Wirkung auf die nächsten Monate. Die Ergebnisse zeigen, dass KI eine vielversprechende Unterstützung im Kampf gegen #Fehlinformationen sein könnte.
#KünstlicheIntelligenz #Verschwörungstheorien #Faktencheck #Studie #GPT4 #Science
GitHub Copilot Under Fire: A Deep Dive into AI Coding Performance
In a recent series of tests, GitHub Copilot, powered by OpenAI's GPT-4, showcased mixed results in coding tasks, raising questions about its reliability for developers. While it succeeded in some area...
https://news.lavx.hu/article/github-copilot-under-fire-a-deep-dive-into-ai-coding-performance