Apache Superset is a web-based data browsing and visualization application. Superset on EMR supports MySQL, Hive, Presto, Impala, Kylin, Druid, and ClickHouse.
Enter http://${master_ip}:18088
in your browser (or go to the EMR console > Cluster Service) to open the login page of Supserset. The default username is admin
, and the password is the one you set when creating the cluster.
Go to Sources > Databases and click Filter List.
On the following page, add the URI of the component to be added in SQLAlchemy URI.
The SQLAlchemy URI for each database is as follows:
Name | SQLAlchemy URI | Remarks |
---|---|---|
MySQL | mysql+pymysql://<mysqlname>:<password>@<mysql_ip>:<mysql_port>/<your_database> |
mysqlname : Username used to connect to MySQL.password : MySQL password.your_database : The MySQL database to be connected to. |
| Hive | `hive://hadoop@<master_ip>:7001/default?auth=NONE` | `master_ip`: Master IP of the EMR cluster. |
| Presto | presto://hive@<master_ip>:9000/hive/<hive_db_name>
|
master_ip
of the EMR clusterdefault
by default |impala://<core_ip>:27000
| core_ip
: core IP of EMR cluster. |kylin://<kylin_user>:<password>@<master_ip>:16500/<kylin_project>
| master_ip
of the EMR clusterclickhouse://<user_name>:<password>@<clickhouse-server-endpoint>:8123/<database_name>
| clickhouse://default:password@localhost:8123/default
Superset supports databases. To install another database, follow the steps below:
source /usr/local/service/superset/bin/activate
command.
Was this page helpful?