The SunnyData Blog
Explore insights and practical tips on mastering Databricks Data Intelligence Platform and the full spectrum of today's modern data ecosystem.
AI
Cost allocation, cloud tags, and other relevant things on Databricks
In this three-part video series, Databricks MVP Josue Bogran and Greg Kroleski from Databricks’ “Money Team” discuss cost allocation, tagging best practices, cloud innovations, and performance enhancements. The series explores Databricks' efforts to optimize cost reporting, leverage AI for insights, and empower businesses to make smarter, growth-driven decisions in 2024 and beyond.
Seamless Data Integration: SAP to Databricks
Learn how to integrate SAP data into Databricks with this comprehensive blog. Discover the essential components of the SAP ecosystem, including SAP HANA, S/4HANA, and BTP, and explore proven integration methods using SparkJDBC and Azure Data Factory. Perfect for data engineers and architects looking to combine SAP's enterprise management capabilities with Databricks' advanced analytics.
5 Reasons Why We Recommend Databricks
Discover why Databricks stands out as the leading data platform in this insightful blog. From unified data management to cost efficiency, unmatched performance, and robust analytics, Josue Bogran explains the top 5 reasons Databricks excels in the competitive landscape. Learn how Databricks balances innovation, user-centric design, and industry versatility to deliver exceptional results.
Performance, Benchmarks, and Optimization Tips for Databricks Users
Josue Bogran interviews Jeremy Lewallen from Databricks’ Performance Team, exploring benchmarks, storage cost optimization, rightsizing SQL Serverless Compute, and common compute mistakes. Discover why Databricks continually enhances performance, tips for using the latest DBRs, and how their innovations provide a fast, efficient, and developer-friendly data platform.
Redshift to Databricks - Part 2: Technical Implementation Guide
This guide dives into the technical steps required to migrate from Amazon Redshift to Databricks. Covering everything from discovery and data evaluation to security protocols and cost estimation, it offers detailed, practical strategies for managing dependencies, optimizing queries, and planning for future scalability within Databricks’ robust ecosystem.
Are we in an AI Bubble?
There is a lot of buzz around GenAI, LLMs, AI, and ML. Knowing what you should implement or not implement is tough.
In this set of 3 videos, our Technical Advisor Josue Bogran has a frank and open conversation about these subjects with Denny Lee, Principal Developer Advocate at Databricks.
Databricks SQL in 5 Minutes
Databricks SQL is easy to get started and highly performant. Don't just take our word for it. Check out the video our technical advisor Josue put out that shows how simple it is to get started with, as well as another video that he did comparing Databricks vs Snowflake query performance.
Redshift to Databricks - Part 1: Why and How to Start Your Migration
This blog introduces the strategic benefits and challenges of migrating from Amazon Redshift to Databricks. It covers Redshift’s legacy limitations, Databricks' advantages, and critical migration factors. The article provides an overview of key planning steps, including architecture considerations and phased migration strategies, setting the stage for technical execution in the upcoming part two.
Sigma + Databricks: A Great BI Tool for the Data Intelligence Platform
Sigma offers a user-friendly BI tool ideal for building and managing dashboards. With intuitive features, seamless embedding, and live data connections, it excels in usability. However, its steep pricing and lack of Git integration may deter small businesses. SunnyData recommends Sigma for startups and enterprises seeking responsive, customizable solutions.
How to Migrate Databricks from GCP to Azure or AWS
This blog explores the migration process of Databricks from one cloud provider (GCP) to another (Azure or AWS). It emphasizes using tools like Terraform for seamless migration, best practices for handling resources, data, and configurations, and discusses strategic reasons for switching cloud platforms.
Day 2 of Databricks vs Snowflake vs Fabric: Evaluating The Toolset
Databricks vs. Snowflake vs. Fabric evaluates key aspects like toolsets, partner ecosystems, flexibility, ease of use, and overall business value. We break down each platform's strengths and weaknesses to guide you in choosing the best for your data strategy.
Why Startups should consider Databricks as a top choice for their data platform for analytics, AI and data management.
Databricks is a top choice for startups seeking an all-in-one data platform for analytics, AI, and data management. Its cloud-native, scalable, and cost-efficient architecture allows startups to begin small, grow, and avoid complex migrations as needs evolve.
How to migrate your ETL workloads and EDW from Snowflake to Databricks
In this blog, we outline the essential steps for migrating ETL workloads and EDW from Snowflake to Databricks. From data migration to report modernization, we break down five key phases for a seamless and efficient transition to Databricks.
Why no one migrates from Databricks to Snowflake
This blog explores why companies are increasingly migrating from Snowflake to Databricks, highlighting Databricks’ integrated platform, cost-efficiency, and comprehensive data and AI capabilities. The post examines the pitfalls of Snowflake's pricing and its struggle to replicate Databricks' functionality.
Fabric Meets Databricks: A Technical Review
This blog compares Microsoft Fabric and Databricks, focusing on pricing, features, governance, and scalability. It concludes that while Fabric suits smaller businesses, Databricks excels for medium to large enterprises due to its flexibility, innovation, and advanced data capabilities.
Why Migrate to Databricks?
Migrating to Databricks offers companies unmatched benefits, including cost efficiency, scalability, and advanced AI/ML capabilities. With a unified platform and robust data governance, Databricks empowers businesses to modernize their data estates, ensuring they’re future-ready and positioned for growth.
Step Away From “The State Of The Art”
This blog argues against the rush to adopt the latest AI technologies, emphasizing the importance of strong data engineering foundations and "Classic Analytics." It advocates for a balanced, pragmatic approach to AI adoption, ensuring long-term success over short-term gains.
Databricks AI/BI Series: A Technical Overview of AI/BI Genie
Databricks AI/BI Genie leverages advanced AI models and Unity Catalog to answer business-specific queries with precision. This blog explores its potential to surpass traditional dashboards, offering a powerful tool for real-time, self-service analytics in a governed environment.
Fabric Meets Databricks: A Preliminary Review for Data Practitioners
This article provides an initial analysis of Microsoft Fabric, comparing it with Databricks. It discusses why Fabric may not meet enterprise needs and explores potential integrations between Fabric and Databricks, highlighting how they can complement each other in data projects.
Databricks AI Assistant: SQL Review
Databricks' AI Assistant is now GA and excels in SQL tasks, leveraging Unity Catalog metadata. It consistently delivers functional SQL queries and follows instructions well, proving itself superior to alternatives like ChatGPT for SQL code generation. Watch the video for insights.