• Lior's View
  • Posts
  • ⚔️ OpenAI Dev Day vs Elon's New LLM

⚔️ OpenAI Dev Day vs Elon's New LLM

On Elon Musk's new model, OpenAI dev day, Runway's hyperrealistic update, Cohere EmbedV3, and Biden's AI mandate.

AlphaSignal

Hey ,

Welcome to this week's edition of AlphaSignal.

Whether you are a researcher, engineer, developer, or data scientist, our summaries ensure you're always up-to-date with the latest breakthroughs in AI.

Let's get into it!

Lior

In Today’s Email:

  • Top Releases and Announcements in the Industry

  • OpenAI Dev Day will be live streamed today

  • Elon and 𝕏AI team just released Grok, a powerful competitor

  • Phind beats ChatGPT while being 5x faster

Read Time: 4 min 29 sec

RELEASES & ANNOUNCEMENTS

OpenAI Dev Day, Their First-Ever Developer Conference, Will Be Streamed LIVE Today at 10AM PST
The OpenAI DevDay is starting today and the excitement across the entire Silicon Valley is similar to the early days of iPhone announcements. People are eager to hear about the latest developments in ChatGPT, including the possibility of custom bots and more affordable options.

Stability AI Unveils New Tools for Image Editing, Fine Tuning, and 3D Asset Generation
Stability AI announces enterprise-grade APIs, a Sky Replacer tool, Stable 3D for quick 3D model generation, and Stable FineTuning for image customization, with added transparency features like Content Credentials and watermarking for AI-generated images.

Runway Updates Gen-2 AI Video Model Stunning the Industry
Runway has updated its Gen-2 AI video model, significantly enhancing video fidelity and consistency, and introduced high-definition motion capabilities for AI-generated or still image-based videos, impacting the future of AI filmmaking.

Luma AI Launches a New text-to-3D Tool, Now Available for Free
Luma AI's Genie, a new tool for creating 3D models from text, is now available on Discord, enabling quick generation and download of customizable objects in GLB format, with plans for future enhancements.

Cohere Releases Embedv3, Their Most Advanced Text Embedding Model
Embed v3 introduces optimized text embeddings with SOTA MTEB/BEIR performance, robust noise tolerance, and storage-efficient compression, enhancing retrieval tasks and multilingual support.

Must Watch: OpenAI + Scale on How To Fine-TuneGPT 3.5 For Your Business

Have you considered using OpenAI's GPT-3.5 for your company, but weren’t sure where to start?

Join OpenAI and Scale on November 8th at 10 AM PT where you’ll learn:

  • When and how to fine-tune GPT-3.5

  • How to optimize your company data for fine-tuning GPT-3.5

  • Most importantly: How to avoid the biggest mistakes other enterprises made when fine-tuning GPT-3.5

This is a great opportunity to dive deep into what fine-tuning GPT-3.5 can do for enterprises, while learning from real-world use cases along the way. By the end of the session, you'll know how to get started with GPT-3.5 in your own organization.

You don’t want to miss it.

NEWS
Elon Musk’s 𝕏AI Unveils Grok-1, a Powerful GPT and Claude Competitor

What's New?
xAI, the company led by Elon Musk, has launched Grok, their first AI assistant powered by a new frontier large language model, Grok-1. Competing with OpenAI's ChatGPT, it excels in reasoning and coding, featuring a sophisticated set of capabilities that includes humor and proactive questioning.

Why Does It Matter?
Grok-1 is designed to serve as a research assistant. It exhibits exceptional reasoning, achieving strong performance on industry benchmarks. The model’s fundamental advantage is its real-time connectivity to global data via the 𝕏 platform. As such, it can provide users with the most relevant, up-to-date information.

Key Takeaways:

  1. Enhanced Reasoning: Grok-1's outperforms GPT-3.5 across the board, and surpasses Claude 2 on certain math benchmarks.

  2. Real-Time Data: Grok will have near real-time access to tweets from 𝕏, allowing it to provide the most current information.

  3. Focus on Robustness: xAI designed a custom training and inference stack to maximize model uptime.

  4. Early Beta Access: Grok is currently in its initial beta phase, inviting user feedback.

NEWS
Executive Order on AI: Biden Sets New Standards for Safety and Equity

What's New?
President Biden has issued an Executive Order aimed at setting new standards for AI safety, security, and equity. It mandates companies developing high-impact AI to report safety test results to the U.S. government. Additionally, it outlines measures for AI-enabled fraud detection, privacy, and civil rights protections. The National Institute of Standards and Technology is tasked with developing rigorous testing standards for AI systems.

Why Does It Matter?
The Order's emphasis on AI safety and security standards marks a significant shift towards structured oversight in AI development, especially for models with national security implications. It draws a clear line for federal engagement in AI regulation, potentially affecting large-scale AI developers. However, it's crucial for stakeholders, including smaller players, as it could shape future compliance landscapes and influence open-source AI developments.

Main Takeaways

  • Enhanced AI Safety: Implementation of comprehensive testing and safety measures for AI systems.

  • Privacy Protection: Development of privacy-preserving technologies and federal privacy guidelines.

  • Equity and Civil Rights: Guidance to prevent AI algorithms from perpetuating discrimination.

  • Consumer Safeguard: Standards for responsible AI use in healthcare, education, and consumer goods.

  • Workforce Support: Principles to address AI's impact on jobs and labor standards.

NEWS
Phind’s New Model Beats GPT-4 While Coding 5 Times Faster

What's New?
Phind has unveiled its 7th-generation model, outperforming GPT-4 in coding proficiency with the speed of GPT-3.5 and an impressive 16k token context.

The new Phind model is built on top of their open-source CodeLlama-34B fine-tune, and was fine-tuned with over 70 billion tokens. It scores 74.7% on the HumanEval benchmark.

Why Does It Matter?
Phind’s model is the first to surpass GPT-4 on the HumanEval benchmark, and Phind model V7 far exceeds other open source coding models. User feedback highlights that Phind's model often provides better or equal help compared to GPT-4 for real coding problems, becoming the preferred choice within its community. What’s more, the model is 5x faster than GPT-4.

Main Takeaways

  • Coding Excellence: Phind Model beats GPT-4 on the HumanEval benchmark and often provides more helpful responses on real-world coding problems.

  • Superior Speed: The model achieves a 5x speed increase over GPT-4 by using NVIDIA's TensorRT-LLM library, reaching 100 tokens per second.

  • Expanded Context: The model supports up to 16k tokens of context, allowing inputs of up to 12k tokens, and reserving 4k tokens for results of web searches.

Read Last Week’s Summaries:

How was today’s email?

Not Great      Good      Amazing

Thank You

Igor Tica is a contributing writer at AlphaSignal and a research engineer at SmartCat, focusing on computer vision. He's actively seeking partnerships in self-supervised and contrastive learning.

Jacob Marks is an editor at AlphaSignal and ML engineer at Voxel51, is recognized as a leading AI voice on Medium and LinkedIn. Formerly at Google X and Samsung, he holds a Ph.D. in Theoretical Physics from Stanford.

Want to promote your company, product, job, or event to 150,000+ AI researchers and engineers? You can reach out here.