包阅导读总结
1.
“`
PyCharm、Databricks、Integration、Data Analytics、Python
“`
2.
PyCharm 与 Databricks 实现集成,该集成方便用户处理、存储和分析数据,结合了两者优势,无需额外配置,适用于特定组织和数据专业人员,文中还介绍了其获取方式及优势。
3.
– PyCharm Databricks 集成介绍
– 集成目的:方便数据处理、存储和分析
– 集成优势
– 可在 PyCharm Professional 内基于 Databricks 构建数据和 AI 应用
– 快速写代码并在云运行,无需额外配置
– 结合各平台强大功能,利于大规模应用开发
– 具体功能
– 连接到 Databricks 集群并监控
– 在远程集群运行 Python 脚本
– 将 Jupyter 笔记本或 Python 脚本作为工作流运行
– 同步项目文件到 Databricks 工作区
– 如何开始
– 从 JetBrains Marketplace 或 PyCharm IDE 安装插件
– 参考文档获取详细步骤
思维导图:
文章地址:https://blog.jetbrains.com/pycharm/2024/08/introducing-the-pycharm-databricks-integration/
文章来源:blog.jetbrains.com
作者:Stanislav Garkusha
发布时间:2024/8/19 8:47
语言:英文
总字数:659字
预计阅读时间:3分钟
评分:89分
标签:PyCharm,Databricks,数据平台,Python IDE,大数据
以下为原文内容
本内容来源于用户推荐转载,旨在分享知识与观点,如有侵权请联系删除 联系邮箱 media@ilingban.com
Introducing the PyCharm Databricks Integration
We’re introducing the Databricks integration with PyCharm Professional to make it easier for you to process, store, and analyze your data!
The integration allows you to build your data and AI apps on the Databricks Data Intelligence Platform directly within PyCharm Professional, enhancing the data analytics platform with the powerful Python IDE by JetBrains. It enables you to write code quickly and easily and run it in the cloud without extra configurations, and it offers additional benefits for working with data.
Read this blog post to learn more about the integration, who it will be useful for, and what benefits it offers.
Watch the plugin in action
VIDEO
What is Databricks?
The Databricks Data Intelligence Platform allows your entire organization to use data and AI. It’s built on a lakehouse to provide an open, unified foundation for all data and governance, and is powered by a Data Intelligence Engine that understands the uniqueness of your data.
What is PyCharm Professional?
PyCharm Professional is a leading IDE for Python and other programming languages. It allows you to write high-quality and efficient code using superior code completion, refactoring capabilities, code inspections, seamless code and project navigation, a debugger, and a wide range of integrations, including Jupyter notebooks, testing frameworks, Git, CI/CD solutions, and more – all available in one place right out of the box.
Who will the integration be useful for?
Organizations and data professionals using data lakehouses, data lakes, and data warehouses via Databricks will benefit from this integration.
What benefits does the integration bring?
The integration combines the most powerful capabilities of each platform, allowing you to easily build all of your data and AI applications at scale within PyCharm:
- Use PyCharm to implement software development best practices, which are essential for large codebases, such as source code control, modular code layouts, testing, and more.
- Databricks enables the use of powerful clusters, allowing you to work on projects too large for a local machine and helping you orchestrate data processing efficiently.
You can write the code for your pipelines and jobs in PyCharm, then deploy, test, and run it in real time on your Databricks cluster without any additional configurations.
Let’s dive into more details about what the PyCharm Databricks integration provides.
Connect to your cluster via PyCharm
You can connect directly to the Databricks cluster via PyCharm and monitor the process within the IDE. This allows you to check if the cluster is running, see the results of the current session’s runs, and view process outcomes along with additional details.

Run Python scripts on a remote cluster
In addition, you can run Python scripts on a remote cluster, which is particularly useful for working with big data, and view the results in the IDE.

Run Jupyter notebooks or Python scripts as workflows
Additionally, you can run your notebook or Python scripts as a Databricks workflow and see the output in the console.

You can see the results of the runs on the Databricks platform, including the runs initiated from PyCharm.

Synchronize project files to the Databricks workspace
The synchronization of project files with the Databricks workspace allows you to access and work with the same files in both PyCharm and Databricks workspaces. You can also schedule your notebooks and scripts and utilize other platform features for projects completed in PyCharm.

How to get started
Make sure you have the following ready to go:
You can install the Databricks plugin either from JetBrains Marketplace or directly from within the PyCharm IDE.
Head over to the documentation to get step-by-step instructions on how to get started and use the plugin.
What do you think about this integration? Share your thoughts in the comments below.