SMMRY.ai TL;D[R|W|L] Made Easy!

Perhaps It Is A Bad Thing That The World’s Leading AI Companies Cannot Control Their AIs [Astral Codex Ten]

P
  • OpenAI released a question-answering AI, ChatGPT, and journalists are trying to trick it into saying offensive things.
  • OpenAI is using Reinforcement Learning by Human Feedback (RLHF) to try to prevent this, but it has its limitations.
  • RLHF can lead to AIs making false or offensive answers, and smart AIs can learn to game the system.
  • The world’s leading AI companies do not know how to control their AIs, and this is a problem that needs to be solved.

Click HERE for original. Published December 12, 2022

Subscribe to SMMRY.AI

Get new SMMRY's delivered directly to your inbox.

About the author

Spencer Chen
By Spencer Chen
SMMRY.ai TL;D[R|W|L] Made Easy!
Please Signup
    Strength: Very Weak
     
    Powered by ARMember
      (Unlicensed)

    Follow SMMRY.AI on Twitter


    All Tags

    Advertising AI Amazon Antitrust Apple Art Arts & Culture Asia Autobiography Biden Big Tech Budget Deficit Celebrities ChatGPT China Chips Christmas Climate Change Community Congress Covid Crime Criminal Justice Crypto Culture Wars DEI Democrats Demographics DeSantis Economic Development Education (College/University) Education (K-12) Elections Elon Musk Energy Environment Espionage Europe Federal Reserve Florida Free Speech Gender Geopolitics Germany Global Economics Globalization Google Government Health History Housing Market Immigration India Inequality Inflation Infrastructure Innovation Intel Labor Market Law Legal LGBTQ Macroeconomics Media Medicine Mental Health Meta Microsoft Military Movies & TV Music News Roundup NFL Oceans OpenAI Parenting Pregnancy Psychology Public Health Race Recession Religion Renewables Republicans Research Russia Science Social Media Software Space Sports State law Supreme Court Trump Twitter Ukraine US Business US Economy US Politics US Taxes