Learning AI Using ChatGPT

Alessandro Federici Apr 4, 2023 4 min read

In this post, I will share some of what I learned about Artificial Intelligence and ChatGPT during the past couple of weeks. I'll discuss Natural Language Processors, token similarity, and present examples that show how to complete a statement and how to translate a sentence from French to English. Finally, I will show a preview of my first chatbot written in C#.

Natural Language Processors

ChatGPT is a Natural Language Processor (NLP): a type of Artificial Intelligence that helps computers understand and interpret human language. NLPs are commonly used to create chatbots, allowing them to generate human-like responses to user input.

To do all this, the NLP first breaks down the user's text into individual words or phrases, known as "tokens" as in the following diagram (source).

These tokens are then converted into mathematical representations called "vectors," which allow the AI model to analyze and compare them for similarity. The following diagram shows how this looks (source).

https://developers.google.com/machine-learning/guides/text-classification/step-3 — Token similarity examples

💡

While tokens and words are often equivalent in NLP, there are cases where a single word may be split into multiple tokens or where tokenization may be more complex (e.g. languages without spaces between words).

Cosine Similarity

Cosine similarity is a measure used in NLP to compare the similarity between two vectors, with a value between -1 and 1 indicating the degree of similarity.

ChatGPT uses this measure to generate contextually relevant and grammatically correct predictions for the user's prompt by comparing it to a large database of text and using machine learning to improve accuracy (diagram source).

How is the response generated?

To generate a complete response from a vector representation of the similarity between tokens, ChatGPT uses a combination of natural language processing and machine learning techniques.

Natural Language Processing Techniques

Tokenization: breaking down the text into individual words or phrases (tokens) for analysis.
Part-of-Speech Tagging: identifying the part of speech of each token in a sentence.
Named Entity Recognition: identifying and extracting named entities (such as people, places, and organizations) from the text.
Sentiment Analysis: determining the sentiment or emotion conveyed in a piece of text.

Machine Learning Techniques

Recurrent Neural Networks (RNNs): a type of neural network that can process sequential data, such as text, by maintaining a memory of previous inputs.
Transformers: a type of neural network that uses self-attention mechanisms to process and analyze text, resulting in more accurate predictions.
Gradient Boosting: a type of ensemble learning algorithm that combines multiple weak models to make more accurate predictions.
Support Vector Machines (SVMs): a type of machine learning algorithm used for classification, that separates data into different classes using a hyperplane in high-dimensional space.

Overall, the combination of natural language processing and machine learning techniques enables NLP models to analyze and generate text in a more intelligent and human-like way. By leveraging both types of algorithms, NLP models can understand and interpret the meaning and context of a text, and generate responses that are more accurate and contextually relevant.

How do I write a basic chatbot?

Most of what was discussed so far is hidden from the developer thanks to the APIs of OpenAI, AzureAI, etc. but understanding how tokens work is important to write better code and manage costs.

When we write code to access NLPs we think in terms of "creating completions", "translating", and so on. The following examples should give an idea of the type of APIs we will be accessing.

Text Generation

This example generates a text completion for the prompt "Once upon a time", using the OpenAI API.

The max_tokens parameter controls the maximum length of the generated text, while the temperature parameter controls the "creativity" of the text (higher values result in more unpredictable text).

The answer would be similar to this:

This first example is the foundation of more complex examples like the ChatGPT client itself, which can be implemented as a loop that keeps doing what this example does, while at the same time "remembering" all the things that have been said since the beginning of the chat.

💡

At the end of this article will show a preview of a chat client I wrote using C#.

Language Translation

This example uses the OpenAI API to translate the text "Bonjour, comment ça va?" from French to English.

The model parameter specifies the translation model to use, while the source and target parameters specify the source and target languages for the translation. The resulting translation is outputted to the console.

The response will be "Hello how are you".

Next steps

In the next post I will show how I created my first ChatBot in C#, using what I learnt so far and the help of Microsoft Semantic Kernel.

For more information on Microsoft Semantic Kernel go here:

What is Semantic Kernel?

Overview of Semantic Kernel

Microsoft Learnjohnmaeda

Learning AI Using ChatGPT

Natural Language Processors

Cosine Similarity

How is the response generated?

Natural Language Processing Techniques

Machine Learning Techniques

How do I write a basic chatbot?

Text Generation

Language Translation

Next steps

Alessandro Federici

How to Quickly Adjust Visual Studio Fonts

We're a Network - Alan Watts

Featured posts

What is Solarpunk?

The Tao Te Ching on law and government

God Is Change: The Radical Philosophy of Earthseed

Tags

Latest posts

What is Solarpunk?

The Process

The Tao Te Ching on law and government

Follow us

What is Solarpunk?

The Process

The Tao Te Ching on law and government

Natural Language Processors

Cosine Similarity

How is the response generated?

Natural Language Processing Techniques

Machine Learning Techniques

How do I write a basic chatbot?

Text Generation

Language Translation

Next steps

Featured posts

Tags

Latest posts

Newsletter

Follow us