paint-brush
Take the Guesswork Out of Installing DolphinScheduler—Here’s How to Do Itby@williamguo

Take the Guesswork Out of Installing DolphinScheduler—Here’s How to Do It

by William Guo
William Guo HackerNoon profile picture

William Guo

@williamguo

William Guo, WhaleOps CEO, Apache Software Foundation Member

February 13th, 2025
Read on Terminal Reader
Read this story in a terminal
Print this story
Read this story w/o Javascript
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Ambari is a tool for managing Hadoop clusters. Although Ambari does not directly support the installation of DolphinScheduler, we can integrate it into Ambari through a custom service. This article will describe how to use Ambari to install DolphinScheduler.
featured image - Take the Guesswork Out of Installing DolphinScheduler—Here’s How to Do It
1x
Read by Dr. One voice-avatar

Listen to this story

William Guo HackerNoon profile picture
William Guo

William Guo

@williamguo

William Guo, WhaleOps CEO, Apache Software Foundation Member

About @williamguo
LEARN MORE ABOUT @WILLIAMGUO'S
EXPERTISE AND PLACE ON THE INTERNET.
0-item
1-item

STORY’S CREDIBILITY

Code License

Code License

The code in this story is for educational purposes. The readers are solely responsible for whatever they build with it.

Guide

Guide

Walkthroughs, tutorials, guides, and tips. This story will teach you how to do something new or how to do something better.

Apache DolphinScheduler is a distributed, decentralized, and scalable visual DAG workflow task scheduling systeåm. It is committed to solving the complex dependencies in data processing workflows, making the scheduling system ready-to-use out of the box. Ambari is a tool for managing Hadoop clusters. Although Ambari does not directly support the installation of DolphinScheduler, we can integrate it into Ambari through a custom service. This article will describe how to use Ambari to install DolphinScheduler.

1. Ambari Service Installation Package

Place the service installation package that needs to be integrated into Ambari in the following directory:

/var/lib/ambari-server/resources/stacks/HDP/3.1/services

2. Restart the Ambari-server Service

sudo ambari-server restart

After the integration, you can proceed with the service installation on the Ambari web page.

  1. Add Service and Select DolphinScheduler
    image
  2. DolphinScheduler Service Configuration Choose to use MySQL as the metadata database for the DolphinScheduler service.
    image
  3. Service Selection for Installation Select two DS-master nodes, at least two DS-ALERTS, DS-APIs, and deploy the remaining nodes as DS-worker nodes.
    image
  4. Error Handling for hue-desktop-ini If you have configured namenode-ha before installing DolphinScheduler, you may encounter errors related to 'hue-desktop-ini'. You need to modify the params.py script.
Script location: /var/lib/ambari-server/resources/stacks/HDP/3.3/services/DOLPHINSCHEDULER/package/scripts/params.py

Modify the content:
# hdfs_httpfs_host = config['configurations']['hue-desktop-ini']['hdfs_httpfs_host']
hdfs_httpfs_host = "dfs_ha_enabled"  # Comment out the original line and assign any value to hdfs_httpfs_host


image
After making the changes, save and restart the ambari-server service to reinstall DolphinScheduler.

  1. Copy MySQL Driver to All Nodes' libs Directory
cp /tmp/mysql-connector-j-8.0.33.jar /usr/hdp/current/dolphinscheduler/alert-server/libs/
cp /tmp/mysql-connector-j-8.0.33.jar /usr/hdp/current/dolphinscheduler/api-server/libs/
cp /tmp/mysql-connector-j-8.0.33.jar /usr/hdp/current/dolphinscheduler/master-server/libs/
cp /tmp/mysql-connector-j-8.0.33.jar /usr/hdp/current/dolphinscheduler/worker-server/libs/
cp /tmp/mysql-connector-j-8.0.33.jar /usr/hdp/3.3.1.0-004/dolphinscheduler/tools/libs/
  1. Change Ownership of All Services to hdfs User
chown -R hdfs:hdfs /var/log/dolphinscheduler
chown -R hdfs:hdfs /usr/hdp/3.3.1.0-004/dolphinscheduler
chown -R hdfs:hdfs /etc/dolphinscheduler/
ln -s /usr/hdp/3.1.4.0-315/dolphinscheduler dolphinscheduler

After making these changes, start DolphinScheduler from the Ambari management platform. The metadata will be automatically initialized and tables will be created during the startup process.

L O A D I N G
. . . comments & more!

About Author

William Guo HackerNoon profile picture
William Guo@williamguo
William Guo, WhaleOps CEO, Apache Software Foundation Member

TOPICS

THIS ARTICLE WAS FEATURED IN...

Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
Hackernoon
Threads
Bsky
X REMOVE AD