Function Calling — Topic Summaries
AI-powered summaries of 25 videos about Function Calling.
25 summaries
OpenAI DevDay: Opening Keynote
OpenAI’s DevDay keynote centers on a major shift from “chat” toward practical, agent-like AI—powered by a new GPT-4 Turbo model, new multimodal...
Introducing gpt-realtime in the API
OpenAI is rolling out a new GPT-realtime speech model and an upgraded real-time API in general availability, aiming to make voice interactions with...
OpenAI GPT-4 Function Calling: Unlimited Potential
Function calling turns GPT-4 from a chatty text generator into a tool that can reliably output structured, machine-ready inputs for real code—cutting...
OpenAI DevDay 2024 | Multimodal apps with the Realtime API
OpenAI’s Realtime API is built to deliver natural, low-latency “speech-in, speech-out” experiences through a single interface—removing the multi-step...
Anthropic's Meta Prompt: A Must-try!
Anthropic’s “Metaprompt” tool turns weak, one-off prompts into a structured, model-ready instruction set—by using Claude itself to generate the final...
OpenAI DevDay 2024 | Structured outputs for reliable applications
Structured outputs are OpenAI’s push to make LLM results dependable for real applications by forcing model outputs to match developer-supplied JSON...
OpenAI Realtime API - The NEW ERA of Speech to Speech? - TESTED
OpenAI’s Realtime API can deliver genuinely interactive “speech-to-speech” style experiences—built by wiring a persistent WebSocket—while also...
Llama 3 8B: BIG Step for Local AI Agents! - Full Tutorial (Build Your Own Tools)
A local Llama 3 8B agent can be made genuinely useful by giving it a small set of “tools” (Google search, RAG-based retrieval, and email sending) and...
Model Context Protocol - The Why | MCP Trilogy | CampusX
Model Context Protocol (MCP) is positioned as the missing layer that lets AI assistants work across many tools without the usual copy‑paste “context...
HUGE Open AI Announcements: GPT-4 Turbo, GPTs in ChatGPT, Assistants API, new modalities
OpenAI’s Dev Day announcements put a clear emphasis on scaling what GPT-4 can do—faster, cheaper, and with far longer context—then packaging those...
DeepSeek R1 for Structured Agents
DeepSeek’s R1 reasoning model can’t natively produce the structured, tool-friendly outputs that most agent frameworks rely on—no function calling, no...
Building a LangGraph ReAct Mini Agent
A simple LangGraph pattern—one “reasoner” node plus a single prebuilt “tools” node—can replace sprawling agent graphs full of separate nodes for each...
Unlock Open Multimodality with Phi-4
Microsoft’s Phi-4 family just got more practical for local, multimodal work: the Phi-4 3.8B “mini instruct” lineup now includes function calling and...
Mistral Small 3 - The NEW Mini Model Killer
Mistral has released “Mistral Small 3,” a new 24B-parameter open-weight model positioned as a fast, capable “workhorse” for everyday tasks—aimed at...
GPT-4 Turbo with Google Web Browsing (Assistants API)
A practical Assistants API pattern is on display: rewrite a user question into a Google-friendly search query, fetch fresh web results, scrape the...
Mistral Large with Function Calling - Review and Code
Mistral Large positions itself as a strong alternative to top closed models by pairing solid reasoning performance with native function calling—while...
Gemini 2.0 - Video Analyzer with Code
Gemini’s “Video Analyzer” turns uploaded videos into structured, time-coded outputs—captions, spoken transcripts, visual scene descriptions, key...
AI AGENTS Updates From Google, OpenAI and Anthropic
AI agents are increasingly defined less by raw language ability and more by their ability to pursue goals through a loop of tool use—an approach...
FunctionGemma - Function Calling at the Edge
Function Gemma brings customizable function calling to a compact Gemma model designed for edge deployment—so apps and games can run locally on phones...
The AI Ops Engineer - Next BIG Role in Tech? 🤖
A new “AI Ops Engineer”–style role is taking shape around turning rapidly evolving foundation models into working, shipped products—without requiring...
Could This Change The Way We Use Computers FOREVER? - OpenAI Realtime API Function Calling
A voice-driven “function calling” agent built on OpenAI’s Realtime API can take direct actions on a user’s computer—opening websites, navigating...
Stuck in the Chatbox? Here's When You Actually Need the API
The core message: chatbot access is a deliberately limited “demo,” while the API unlocks more control, better cost transparency, and workflow-level...
Realtime Voice AI AGENTS Will Explode in 2025 | SHOWCASE
Real-time voice AI agents are moving from demos to practical business workflows—using function calling to check availability, confirm bookings, and...
LLM Function Calling (Tool Use) with Llama 3 | Tool Choice, Argument Mapping, Groq Llama 3 Tool Use
Function calling with Llama 3 is no longer a niche capability: a Groq-tuned “Llama 3 tool use” model can reliably translate natural-language requests...
MemGPT - Unlimited Context Window (Memory) for LLMs | Paper review, Installation & Demo
MemGPT targets a core bottleneck in today’s large language models: limited context windows that force earlier parts of a conversation or large...