The definitive AI Glossary of Terms

The AI technology space is expanding rapidly and with that a whole new vocabulary needs to be learned. I have gathered many of those that I come across on a frequent basis (I actually maintain this as an internal resource in our company as well).

Generic AI Glossary of Terms
Semantic Kernel Glossary of Terms
Microsoft 365 Copilot Glossary of Terms
Advanced AI Glossary of Terms

Generic AI Glossary of terms

Agentic AI: Systems with autonomy and goal-directed behavior capable of making independent decisions, planning, and adapting to achieve specific objectives without direct, ongoing human input.

AI Ethics: The study of ethical considerations and the development of guidelines and principles related to the design, implementation, and use of artificial intelligence systems.

AI Fairness: The development and evaluation of AI systems to ensure that they treat all individuals and groups fairly, without discrimination or bias, and promote equal.

Application programming interface (API): Intermediary software components that allow two applications to talk to each other; a structured way for AI systems to programmatically access (usually external) models, data sets, or other pieces of software.

Artificial General Intelligence (AGI): An AI system that can perform any intellectual task that a human being can do, possessing a broad range of cognitive abilities and understanding.

Autonomous Systems: AI-powered machines or devices that can operate independently, without human intervention, to make decisions and perform tasks based on their programming and learned experience.

Azure OpenAI Service: The cloud service where the LLMs used for Microsoft 365 Copilot are hosted.

Bias: The presence of systematic errors in AI models, often due to biases in the training data, which may lead to unfair or discriminatory outcomes.

Big data: Large data sets that can be studied to reveal patterns and trends to support business decisions. It’s called “big” data because organizations can now gather massive amounts of complex data using data collection tools and systems.

Chatbot: A computer program designed to simulate conversation with human users, typically using text or voice-based interfaces.

Computer Vision: A field of artificial intelligence that teaches computers to interpret and understand visual information from the world, such as images, videos, and real-time camera feeds.

DALL-E: An AI model developed by OpenAI for generating images from textual descriptions.

Deep Learning: A subset of machine learning that involves training artificial neural networks to recognize patterns in data and make predictions or decisions.

Embeddings: Embeddings are a type of word representation that captures the semantic meaning of words based on their context in a high-dimensional space. In machine learning and natural language processing, embeddings transform discrete categorical variables (like words) into continuous vectors of fixed dimensions. These vectors capture semantic relationships between words, meaning that words with similar meanings tend to have vectors that are close to each other in this space.

Feature Engineering: The process of selecting, transforming, and creating relevant features or attributes from raw data to improve the performance of machine learning models.

Foundation models: Deep learning models trained on vast quantities of unstructured, unlabeled data that can be used for a wide range of tasks out of the box or adapted to specific tasks through fine-tuning. Examples of these models are DALL-E 2, GPT-4, PaLM, and Stable Diffusion.

GPT and GAI (Generative Artificial Intelligence): Advanced AI models and techniques used for generating content.

Hallucination or artificial hallucination: is a confident response by an AI that does not seem to be justified by its training data.

Jail break: Content restrictions have been placed on AI due to notorious incidents. Ask an AI to describe how to do something illegal or unethical, and they will refuse an answer. However, it's possible to "jailbreak" them – which means to bypass those safeguards using creative language, hypothetical scenarios, and trickery.

Knowledge graph: Also known as Semantic Networks. This helps machines understand how concepts are related.

Large Language Models (LLMs): Advanced AI models capable of understanding and generating human-like text based on the input they receive.

Machine Learning: A subset of artificial intelligence that involves training computer algorithms to learn patterns and relationships in data, allowing them to make predictions or decisions without being explicitly programmed to do so.

Modality: A high-level data category such as numbers, text, images, video, and audio.

Multimodal capabilities: The ability of an AI system to process and generate various types of data (text, images, audio, video) simultaneously, enabling complex tasks and rich outputs.

Natural Language Processing (NLP): A field of artificial intelligence that focuses on the interaction between computers and human language, enabling machines to understand, interpret, and generate human language.

Neural Network: A computing model inspired by the structure and function of the human brain, consisting of interconnected artificial neurons that process information and learn from data.

Prompts: User queries or questions that Microsoft 365 Copilot responds to using the sophisticated map.

Prompt Engineering: The process of designing and optimizing prompts to get desired outputs from AI models.

Reasoning AI: AI systems that perform logical thinking, step-by-step planning, problem solving, and decision making using structured or unstructured data, going beyond pattern recognition to draw conclusions and solve
complex problems.

Reinforcement Learning: A type of machine learning where an AI agent learns to make decisions by interacting with an environment and receiving feedback in the form of rewards or penalties.

Superagency: A state where individuals, empowered by AI, amplify their creativity, productivity, and positive impact. Even those
not directly engaging with AI can benefit from its broader effects on knowledge, efficiency, and innovation.

Supervised Learning: A machine learning task where an algorithm is trained on a labeled dataset, with input-output pairs, to learn the relationship between inputs and outputs.

Synthetic data is information that's artificially generated rather than produced by real-world events.

Transfer Learning: A technique in machine learning where an AI model, pre-trained on one task or dataset, is fine-tuned or adapted to perform a different but related task.

Unsupervised Learning: A machine learning task where an algorithm is trained on an unlabeled dataset, without input-output pairs, to discover patterns or structures in the data.

Vector: A mathematical representation that combines phrases, meanings, relationships, and context of data.

Vector Database: A database designed to store and manage vector data, which can be used in conjunction with embeddings.

Semantic Kernel Glossary of Terms

Semantic Kernel: An open-source SDK that allows the combination of AI services like OpenAI, Azure OpenAI, and Hugging Face with conventional programming languages such as C# and Python. It facilitates the creation of AI apps that merge the capabilities of AI and traditional programming.

AI Orchestration: The process of coordinating and managing various AI models and plugins to deliver enhanced user experiences.

Connectors: Tools provided by Semantic Kernel that enable developers to integrate AI services into their applications. They help in adding memories and models to apps.

Memory: In the context of Semantic Kernel, memory refers to the capability to retrieve information from past interactions or data. It can be simulated in AI apps using specialized plugins.

Plugins: Components that allow applications to interact with the real world. They consist of prompts and native functions that can respond to triggers and perform actions.

Embeddings: Representations of data in a format suitable for machine learning. In the context of Semantic Kernel, embeddings can be used to give apps "memories".

Planner: A tool within Semantic Kernel that can automatically create chains or sequences of actions to meet specific user needs. It does this by combining various plugins loaded into the kernel.

Semantic Functions: Represent the "ears" and "mouth" of the AI app, allowing it to listen to user requests and respond with natural language. These functions connect to the "brain" of the AI app through connectors.

Native Functions: Functions that allow the kernel to call C# or Python code directly. They can be used to manipulate data or perform operations that are not suitable for LLMs, such as calculations.

Microsoft 365 Copilot Glossary of Terms

Business Chat: A feature in Microsoft Teams that, with Copilot, can bring together information from multiple sources to update users on topics or events they might have missed.

Microsoft 365 Copilot: An AI-powered productivity tool that integrates with Microsoft Graph and Microsoft 365 Apps. It offers real-time intelligent assistance to enhance user creativity, productivity, and skills.

OpenAI Plugin Specification: A standard adopted by Microsoft for plugins to ensure interoperability across major AI apps and services like ChatGPT, Bing, and Microsoft 365.

Semantic Index for Copilot: A tool designed to help organizations prepare their data and users for Copilot by creating a sophisticated map of user and corporate data. Formed by encoding and indexing keyword searches into a vector that combines phrases, meanings, relationships, and context of the data. This map aids Microsoft 365 Copilot in understanding more about an organization.

Advanced AI Glossary of Terms

Vector Search: A method of information retrieval where documents and queries are represented as vectors instead of plain text. This allows for similarity search, multi-modal search, recommendation engines, and applications implementing the Retrieval Augmented Generation (RAG) architecture.

Azure Search: A cloud-based search-as-a-service solution provided by Microsoft Azure. It allows developers to integrate sophisticated search capabilities into applications and websites.

Azure Cognitive Search: An evolution of Azure Search, this service combines the core search capabilities with Azure Cognitive Services. It offers AI-powered content understanding, enabling richer search experiences by extracting insights from the data. Azure Cognitive Search - Vector Search it's in Private Preview, allowing you to do a vector search or hybrid search (vector and keyword retrieval).

Azure Cognitive Services: A collection of AI services and APIs offered by Microsoft Azure, covering domains like vision, speech, language, and decision-making.

Retrieval Augmented Generation (RAG): An architecture that combines the capabilities of retrieval-based and generative models for tasks like question answering.

k Nearest Neighbors (kNN): An algorithm used in vector search to find the most similar items to a given query in the vector space.

Vectorization: The process of converting data into vector form. In the context of this article, it refers to the transformation of content (like text or images) into vector representations.

AI Glossary of Terms

Generic AI Glossary of terms

Semantic Kernel Glossary of Terms

Microsoft 365 Copilot Glossary of Terms

Advanced AI Glossary of Terms

You may also be interested in...

Topics

AI Glossary of Terms

Generic AI Glossary of terms

Semantic Kernel Glossary of Terms

Microsoft 365 Copilot Glossary of Terms

Advanced AI Glossary of Terms

You may also be interested in...

Stay in touch!

Topics