Introducing GPT-4: OpenAI's Latest Language Model

Russell Kidson
Mar 16, 2023

OpenAI's latest AI model, GPT-4, has recently been launched and is already being implemented in various applications, including a virtual assistant for the visually impaired and an enhanced language learning bot on Duolingo. So, what distinguishes GPT-4 from previous models such as ChatGPT and GPT-3.5? Here are the five significant differences between these renowned systems.

ChatGPT is not GPT 3.5

Before delving into the differences, it's important to note that ChatGPT is not a version of OpenAI's large language model; instead, it is a chat-based interface that interacts with the model that powers it. While ChatGPT was initially identified as GPT-3.5, it is merely a means of communicating with GPT-3.5 and now with GPT-4.

Now that we have clarified this distinction, let's explore the distinctions between the beloved chatbot and its recently enhanced successor.

GPT-4 is multimodal in its approach

The latest improvement to this adaptable machine learning system is its ability to process 'multimodal' information, which means it can now understand multiple modes of data. In comparison, ChatGPT and GPT-3 were restricted to text-based interactions, being capable of reading and writing but little else - though this was sufficient for numerous use cases.

GPT-4's improved design enables it to comprehend and process relevant information within images, extending beyond simple image recognition tasks. For instance, OpenAI has showcased GPT-4's ability to explain the humor behind an image depicting an oversized iPhone connector. However, the partnership with Be My Eyes highlights a more significant potential of the model. The video demonstrates GPT-4's diverse capabilities by showcasing the utility performing various tasks. This indicates that GPT-4 has the potential to understand and interpret image data in a comprehensive and multifaceted manner.

GPT-4’s memory is more extensive

Large language models like GPT-4 are trained on vast amounts of text data, including web pages and books. However, during actual conversations with users, these models have limits to how much information they can keep in their attention function. The previous versions, GPT-3.5 and the original ChatGPT, were limited to 4,096 'tokens,' which roughly translates to 8,000 words or four to five pages of a book. Beyond this point, the model would start losing track of the conversation.

In contrast, GPT-4 has a token limit of 32,768, which is equivalent to around 64,000 words or 50 pages of text. This increase in token count allows the model to remember up to 50 pages of information during a conversation or text generation. For instance, it can recall a topic discussed 20 pages back in a conversation or refer to an event that occurred 35 pages ago in a written story or essay. While this is an approximate description of how the attention mechanism and token count work, the general idea is that GPT-4 has an expanded memory capacity, enabling it to handle more complex tasks.

It’s harder to lead GPT-4 astray

Despite their advancements, contemporary chatbots are prone to getting misled easily. With just a little bit of coaxing, they can be manipulated to say strange and sometimes unsettling things.  For instance, chatbots can be persuaded to elaborate on what a 'bad AI' would do, or they can collaborate on 'jailbreak' prompts to overcome their restrictions.

However, GPT-4 has undergone extensive training on malicious prompts, which users generously provided to OpenAI over the past couple of years. As a result, GPT-4 surpasses its predecessors in terms of factuality, steerability, and its ability to remain within established guidelines. According to OpenAI, GPT-3.5 (which powered ChatGPT) was a 'test run' of a new training architecture. The lessons learned from that experience were applied to the new model, resulting in a model that is 'unprecedentedly stable.' Additionally, OpenAI was able to predict the model's capabilities more accurately, resulting in fewer surprises.

Related: Visual ChatGPT - Temporary solution until GPT-4’s launch

GPT-4 processes more languages

The field of AI is primarily dominated by English-speaking individuals, and the majority of data, testing, and research papers are in English. However, the capabilities of large language models can be applied to any written language, making it imperative to make them available in other languages.

GPT-4 takes a significant step in this direction by showcasing its ability to accurately answer thousands of multiple-choice questions across 26 different languages, ranging from Italian and Ukrainian to Korean. While the model performs best on Romance and Germanic languages, it generalizes well to other languages as well.

While this preliminary testing of GPT-4's language capabilities is promising, it does not constitute a comprehensive adoption of multilingual abilities. The testing criteria were initially translated from English, and multiple-choice questions do not necessarily represent everyday language usage. However, the model performed remarkably well on this task, despite not having been explicitly trained for it. This suggests that GPT-4 could be significantly more accommodating to non-English speakers.

GPT-4 takes on different personalities

The concept of 'steerability' in artificial intelligence pertains to an AI system's ability to adapt its behavior according to specific requirements. This capability can be beneficial, as in assuming the role of an empathetic interlocutor, or potentially hazardous, such as when individuals manipulate the AI into exhibiting malevolent or despondent traits.

GPT-4 incorporates steerability more seamlessly than its predecessor, GPT-3.5, enabling users to customize the default ChatGPT personality, characterized by a consistent verbosity, tone, and style, to better align with their preferences. The development team emphasizes that this customization has its limits, acknowledging that pushing the model too far may result in breaking character.

Previously, users could indirectly achieve a degree of steerability by priming the chatbot with prompts like, 'Imagine you are a dungeon master in a tabletop RPG' or 'Respond as if you are a guest on a cable news interview.' However, this approach merely offered suggestions to the default GPT-3.5 personality. GPT-4 now empowers developers to incorporate a specific viewpoint, communication style, tone, or interaction mode from the outset.

GPT-4: AI for the next generation of tech

OpenAI's latest AI model, GPT-4, has introduced significant improvements and features that set it apart from its predecessors. With the ability to comprehend images, increased token counts, improved factuality, steerability, and multilingual capabilities, GPT-4 has the potential to revolutionize the field of AI. Additionally, the model's native steerability feature allows users to tailor the chatbot's conversational style and interaction methods to better suit their needs. While there is still room for improvement, GPT-4 is undoubtedly a remarkable achievement that will pave the way for even more advanced AI systems in the future.


Tutorials & Tips

Previous Post: «
Next Post: «


  1. bruh said on August 18, 2023 at 1:25 pm

    Uhh, this has already been possible – I am not sure how but remember my brother telling me about it. I’m not a whatsapp user so not sure of the specifics, but something about sending the image as a file and somehow bypassing the default compression settings that are applied to inbound photos.

    He has also used this to share movies to whatsapp groups, and files 1Gb+.

    Like I said, I never used whatsapp, but I know 100% this isn’t a “brand new feature”, my brother literally showed me him doing it, like… 5 months ago?

  2. 💥 said on August 18, 2023 at 3:55 pm

    Martin, what happened to those: 12 Comments ( Is there a specific justifiable reason why they were deleted?

    Hmm, it looks like the gHacks website database is faulty, and not populating threads with their relevant cosponsoring posts.

  3. 45 RPM said on August 19, 2023 at 6:29 pm

    The page on ghacks this is on represents the best of why it has become so worthless, fill of click-bait junk that it’s about to be deleted from my ‘daily reads’.

    It’s really like “Press Release as re-written by some d*ck for clicks…poorly.” And the subjects are laughable. Can’t wait for “How to search for files on Windows”.

    1. owl said on August 20, 2023 at 12:51 am

      > The page on ghacks this is on represents the best of why it has become so worthless, fill of click-bait junk…

      Sadly, I have to agree.

      Only Martin and Ashwin are worth subscribing to.
      Especially Emre Çitak and Shaun are the worst ones.

      If intended “Clickbait”, it would mark the end of Ghacks Technology News.
      Ghacks doesn’t need crappy clickbaits. Clearly separate articles from newer authors (perhaps AIs and external sales person or external advertising man) as just “Advertisements”!

      We, the subscribers of Ghacks, urge Martin to make a decision.

  4. chessandonions said on August 20, 2023 at 12:40 am

    because nevermore wants to “monetize” on every aspect of human life…

  5. Frank Rizzo said on August 20, 2023 at 11:52 pm

    “Threads” is like the Walmart of Social Media.

  6. Ashray said on August 21, 2023 at 4:06 pm

    How hard can it be to clone a twitter version of that as well? They’re slow.

  7. Paul(us) said on August 21, 2023 at 5:16 pm

    Yes, why not mention how large the HD files can be?
    Why, not mention what version of WhatsApp is needed?
    These omissions make the article feel so bare. If not complete.

    1. Paul(us) said on August 21, 2023 at 5:18 pm

      Sorry posted on the wrong page.

  8. Marc said on August 21, 2023 at 6:00 pm

    such a long article for such a simple matter. Worthless article ! waste of time

  9. plusminus_ said on August 21, 2023 at 7:54 pm

    I already do this by attaching them via the ‘Document’ option.

  10. John G. said on August 21, 2023 at 11:43 pm

    I don’t know what’s going on here at Ghacks but it’s obvious that something is broken, comments are being mixed whatever the article, I am unable to find some of my later posts neither. :S

  11. Tom Hawack said on August 23, 2023 at 2:28 pm

    Quoting the article,
    “As users gain popularity, the value of their tokens may increase, allowing investors to reap rewards.”

    Besides, beyond the thrill and privacy risks or not, the point is to know how you gain popularity, be it on social sites as everywhere in life. Is it by being authentic, by remaining faithful to ourselves or is it to have this particular skill which is to understand what a majority likes, just like politicians, those who’d deny to the maximum extent compatible with their ideological partnership, in order to grab as many of the voters they can?

    I see the very concept of this as unhealthy, propagating what is already an increasing flaw : the quest for fame. I won’t be the only one to count himself out, definitely.

    1. Tom Hawack said on August 23, 2023 at 2:34 pm

      @John G. is right : my comment was posted on [] and it appears there but as well here at []

      This has been lasting for several days. Fix it or at least provide some explanations if you don’t mind.

  12. Tom said on August 24, 2023 at 11:53 am

    > Google Chrome is following in Safari’s footsteps by introducing a new feature that allows users to move the Chrome address bar to the bottom of the screen, enhancing user accessibility and interaction.

    Firefox did this long before Safari.

  13. Mavoy said on September 16, 2023 at 2:17 pm

    Basically they’ll do anything except fair royalties.

Leave a Reply

Check the box to consent to your data being stored in line with the guidelines set out in our privacy policy

We love comments and welcome thoughtful and civilized discussion. Rudeness and personal attacks will not be tolerated. Please stay on-topic.
Please note that your comment may not appear immediately after you post it.