Lucas Ferguson
—Feb 14, 2025
by Lucas Ferguson, CS + Cybersecurity Student @ Illinois Institute of Technology
My AI Adventures and the Latest Innovations As we continue through 2025, the world of artificial intelligence continues to evolve at a breakneck pace. This year has already been packed with exciting developments, from groundbreaking AI tools to major tech announcements. While the industry is buzzing with innovation, I’ve also been diving deep into my own AI experiments at home, exploring how these tools can enhance productivity and creativity. Here’s a look at what’s new in AI this month and how I’ve been putting it all to use.
Here is a quick list of big news already this year!
Corporate AI Race
Dev Tool Updates
Government & Policy Updates
Initiative | Impact | Timeline |
US President Executive Order 14100 | Replaces previous AI ethics guidelines with innovation-first ideas | Effective 1/20/2025 |
UNESCO Ethics Framework | Adopted by 38 nations for AI governance | Ratified 2/1/2025 |
Colorado AI Act | Mandates algorithmic audits for high-risk systems | Enforcement begins Q3 2025 |
Ethics Corner
In the following section I've selected three of the above topics to cover in more detail.
China's DeepSeek continues to challenge proprietary AI dominance with its latest open-source marvel, achieving unprecedented performance-to-cost ratios while maintaining commercial usability. The V3 architecture demonstrates how focused engineering can overcome hardware limitations:
Specification | 2024 (DeepSeek-V2) | 2025 (DeepSeek-V3) | Improvement |
Total Parameters | 371B | 671B | 80% ↑ |
Training Data | 5.8T tokens | 14.8T tokens | 155% ↑ |
MMLU Benchmark | 82.3% | 88.5% | 7.5% ↑ |
Inference Speed | 20 tokens/sec | 60 tokens/sec | 3x ↑ |
Training Efficiency | 8.1M GPU-hours | 2.788M GPU-hours | 65% ↓ |
API Cost per 1M tokens | $0.50 | $0.18 | 64% ↓ |
The V3 model introduces three breakthrough innovations:
DeepSeek's open-source strategy now pressures major players. Developers praise the MIT license's commercial flexibility, especially us here at Open Code Development!
Here's a comprehensive breakdown of DeepSeek model parameters across versions, incorporating both base and distilled models:
1. DeepSeek-R1 Series deepseek-ai/DeepSeek-R1 · Hugging Face
Model | Total Params | Activated Params | Context Length | Architecture | Download |
DeepSeek-R1-Zero | 671B | 37B | 128K | Mixture of Experts | 🤗 HuggingFace |
DeepSeek-R1 | 671B | 37B | 128K | Mixture of Experts | 🤗 HuggingFace |
Model | Total Params | Base Model | Architecture | Download |
DeepSeek-R1-Distill-Qwen-1.5B | 1.5B | Qwen2.5-Math-1.5B | Modified transformer with enhanced math reasoning | 🤗 HuggingFace |
DeepSeek-R1-Distill-Qwen-7B | 7B | Qwen2.5-Math-7B | Dense transformer with multi-head attention | 🤗 HuggingFace |
DeepSeek-R1-Distill-Llama-8B | 8B | Llama-3.1-8B | Rotary positional embeddings | 🤗 HuggingFace |
DeepSeek-R1-Distill-Qwen-14B | 14B | Qwen2.5-14B | Multi-query attention mechanism | 🤗 HuggingFace |
DeepSeek-R1-Distill-Qwen-32B | 32B | Qwen2.5-32B | Hybrid local/global attention patterns | 🤗 HuggingFace |
DeepSeek-R1-Distill-Llama-70B | 70B | Llama-3.3-70B-Instruct | Grouped-query attention (GQA) | 🤗 HuggingFace |
2. DeepSeek-V2 Series deepseek-ai/DeepSeek-V2 · Hugging Face
Model Variant | Total Parameters | Activated/Task Parameters | Architecture |
V2 | 236B | 21B per token | Mixture of Experts |
V2.5 | 238B | 16B per token | Mixture of Experts |
Coder-V2-Base | 16B | 2.4B per token | Mixture of Experts |
Coder-V2 | 236B | 21B per token | Mixture of Experts |
3. DeepSeek-V3 Series deepseek-ai/DeepSeek-V3 · Hugging Face
Model Variant | Total Parameters | Activated/Task Parameters | Architecture |
V3 | 671B | 37B per token | Mixture of Experts |
V3-Base | 685B* | 37B per token | Mixture of Experts+Multi-Token Prediction |
General Hardware Requirements for running these with Ollama:
Samsung’s Galaxy Unpacked event this past January was one of the highlights of the month for me. The new Galaxy S25 series unveiled some incredible AI-powered features that cater specifically to power users like myself. As someone who relies heavily on Google Calendar for organization, my favorite feature is Gemini assistant’s ability to create multiple calendar events or tasks based on research it is able to do.
For example, you can ask Gemini to add all your favorite sports team’s games to your calendar automatically! This kind of natural language integration takes productivity to a whole new level and feels like a glimpse into the future of digital assistants.
Another standout aspect of the Galaxy S25 series is Samsung’s focus on local processing for many AI features. Features like natural language search in the Gallery app, suggested routines, and even audio erasing run directly on the device without needing cloud connectivity. This emphasis on privacy and speed is a refreshing change in an era where many companies rely heavily on cloud-based solutions.
For power users who need more advanced capabilities, Samsung has also introduced tools like "AI Select," which lets you summarize articles or generate creative content without leaving your current app. Combined with local processing capabilities, these features make the Galaxy S25 series one of the most innovative smartphones on the market today.
Here’s a breakdown of which Galaxy S25 features run locally versus those that require cloud connectivity:
Feature | Runs Locally | Requires Cloud |
Natural Language Search (Gallery) | Yes | No |
Suggested Routines | Yes | No |
Now Brief | Yes | No |
AI Select | Yes | No |
Audio Eraser | Yes | No |
Writing Assistance (Auto Format) | Partial | Partial |
Drawing Assist | No | Yes |
Sticker Generation | No | Yes |
Gemini Live | No | Yes |
Cross-App Action | No | Yes |
One of my favorite recent projects has been running DeepSeek’s AI models on my home lab servers. My current server is powered by a six-year-old Intel i5 CPU, 20 GB of RAM, and a solid-state drive with plenty of storage space. Despite these limited resources, I’ve been amazed at how well it handles DeepSeek’s 1.5 billion parameter model. The performance is decent, and the output is somewhat accurate. The lack of accuracy definitely lends it more to creative tasks like brainstorming ideas, planning projects, or game development.
I also tested DeepSeek’s larger 7 billion parameter model. While the quality of its responses is noticeably better, offering richer and more nuanced insights, it runs slower than I’d prefer on my current hardware. Still, the fact that I can run such advanced models locally is a testament to how far AI optimization has come.
Another exciting project I’ve been working on involves enhancing my note-taking workflow using AI. I use Obsidian to organize my academic notes across topics like operating systems, machine learning, augmented reality, and Flutter development. To take this system to the next level, I’ve developed a process that embeds all my notes into a vector database hosted on the same server.
Using Qdrant, a powerful open-source vector database, I can now perform semantic searches across my notes. For example, if I’m studying augmented reality frameworks, I can query the database for related notes on machine learning techniques that enhance AR experiences. This setup has made it incredibly easy to uncover connections between concepts and retrieve relevant information instantly.
Qdrant GitHub: https://github.com/qdrant/qdrant (21.8k Stars as of 2025-02-14)
The combination of DeepSeek’s local processing capabilities and Qdrant’s efficient search functionality has completely transformed how I manage knowledge. It’s not just about storing information anymore, it’s about making it actionable and accessible in ways that save time and spark new ideas.
Dify.AI GitHub: https://github.com/langgenius/dify (67.1k Stars as of 2025-02-14)
This month, I also explored Dify.AI, an open-source platform designed to make building generative AI applications easier than ever. Dify allows users to create complex workflows without writing code by visually connecting components in its Orchestration Studio. Whether you’re building simple agents or crafting intricate pipelines for Retrieval-Augmented Generation (RAG), Dify provides an intuitive interface that lowers the barrier to entry for AI development. With its recent updates focusing on production readiness and scalability, Dify is quickly becoming a favorite among developers and non-developers alike.
This month has reaffirmed how transformative AI can be, not just at an industry level but also in personal workflows. From running advanced models like DeepSeek in my home lab to integrating Qdrant into my note-taking system and exploring tools like Dify, it’s clear that AI is becoming more accessible than ever. while leveraging cloud capabilities for more demanding tasks Samsung’s focus on empowering power users with locally processed features shows how some companies are listening to user needs for privacy. As someone who thrives with productivity tools like Google Calendar, ClickUp, and Obsidian, seeing a glimpse into a future where these tools are able to talk to each other is incredibly inspiring. As we move further into 2025, I’m excited to see how these technologies continue to evolve!
Thanks for reading my article!
If you want to learn more about me and the projects I'm working on be sure to check out my website at lucasferguson.net
https://www.reuters.com/technology/meta-invest-up-65-bln-capital-expenditure-this-year-2025-01-24/
https://news.microsoft.com/en-cee/2025/01/08/6-ai-trends-youll-see-more-of-in-2025
https://www.nytimes.com/2025/01/21/technology/trump-openai-stargate-artificial-intelligence.html
https://www.cnn.com/2025/01/21/tech/openai-oracle-softbank-trump-ai-investment/index.html
https://www.anthropic.com/news/contextual-retrieval
https://devblogs.microsoft.com/visualstudio/announcing-a-free-github-copilot-for-visual-studio
https://www.cursor.com/changelog/-cursor-rules-better-codebase-understanding-new-tab-model
https://theinfluenceagency.com/blog/figma-new-features-2025
https://www.unesco.org/en/artificial-intelligence/recommendation-ethics
https://www.nytimes.com/2024/06/10/technology/california-ai-regulation.html
https://www.infoq.com/news/2025/01/deepseek-v3-llm/
https://hackaday.com/2025/01/27/new-open-source-deepseek-v3-language-model-making-waves/
https://lasvegassun.com/news/2024/jun/13/states-take-up-ai-regulation-amid-federal-standsti/
https://github.com/Lightricks/LTX-Video
https://github.com/qdrant/qdrant (21.8k Stars as of 2025-02-14)
https://github.com/langgenius/dify (67.1k Stars as of 2025-02-14)
https://huggingface.co/deepseek-ai/DeepSeek-R1