Cosmin
Cosmin

Reputation: 716

Apache Ambari retired. What next?

Apache Ambari moved into the Attic in January 2022.

So Apache Ambari has retired, and the only reliable alternative that I know is Cloudera Manager, but Cloudera Manager is a paid service and because of that is not very helpful for small and medium companies.

What tools can now help us install and manage in a proper way the Hadoop ecosystem to go live in production? We'd prefer not to end up with a Hadoop ecosystem installed manually and hardly manageable.

Are there some good alternatives to Apache Ambari?

Upvotes: 4

Views: 4460

Answers (3)

david zollo
david zollo

Reputation: 76

Apache Ambari leaves the attic status and return to the top level apache project

screenshot from mailing list

Upvotes: 6

OneCricketeer
OneCricketeer

Reputation: 192033

The question falls into several categories...

Configuration

For self-hosted solutions, the answer is configuration management + automation tools.

Ansible (+ AWX/Tower), Puppet (+ Foreman, or similar), Chef, etc for file-management (config-as-code, GitOps style). This is far better than Ambari because the config file templates (and history) are actually backed up in VCS rather than spread across several HTML input boxes like in Ambari (or Cloudera Manager).

Use virtualization like VMWare or otherwise acquire physical machines in your data-center.

Otherwise, all cloud-providers have their own dashboards for cluster management and provisioning and elastic scaling. For a "small / medium company", you should focus on your business problem, not infrastructure maintenance, so use the cloud. I've personally used the EMR terraform module, and it was fairly straightforward for a basic cluster (non production, didn't need to maintain it very long).
None of the cloud Hadoop offerings use Ambari.

You previously had , and for that, you could use just use Databricks and not need a whole Hadoop cluster.

Monitoring Widgets

Grafana.

Prometheus JMX Exporter can be added to all the Hadoop JVM processes. Node and Blackbox exporter can also be added for CPU/Mem host usage and TCP/HTTP healthchecks.

Recent HDP releases had already started using Grafana for displaying metrics and a lot of them duplicated the Ambari widgets.

Ambari Alerts

Prometheus AlertManager, for example, but you may want something more robust like NewRelic, DataDog, etc.

UI query functions (Ambari Views)

HUE is probably the closest thing to File Browser, Hive Editor, etc.

Upvotes: 2

steven-matison
steven-matison

Reputation: 1659

You should check out:

https://bigtop.apache.org/

Apache Bigtop - Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and configuration of the leading open source big data components. Bigtop supports a wide range of components/projects, including, but not limited to, Hadoop, HBase and Spark.

Upvotes: 0

Related Questions