Reputation: 716
Apache Ambari moved into the Attic in January 2022.
So Apache Ambari has retired, and the only reliable alternative that I know is Cloudera Manager, but Cloudera Manager is a paid service and because of that is not very helpful for small and medium companies.
What tools can now help us install and manage in a proper way the Hadoop ecosystem to go live in production? We'd prefer not to end up with a Hadoop ecosystem installed manually and hardly manageable.
Are there some good alternatives to Apache Ambari?
Upvotes: 4
Views: 4460
Reputation: 76
Apache Ambari leaves the attic status and return to the top level apache project
Upvotes: 6
Reputation: 192033
The question falls into several categories...
For self-hosted solutions, the answer is configuration management + automation tools.
Ansible (+ AWX/Tower), Puppet (+ Foreman, or similar), Chef, etc for file-management (config-as-code, GitOps style). This is far better than Ambari because the config file templates (and history) are actually backed up in VCS rather than spread across several HTML input boxes like in Ambari (or Cloudera Manager).
Use virtualization like VMWare or otherwise acquire physical machines in your data-center.
Otherwise, all cloud-providers have their own dashboards for cluster management and provisioning and elastic scaling. For a "small / medium company", you should focus on your business problem, not infrastructure maintenance, so use the cloud. I've personally used the EMR terraform module, and it was fairly straightforward for a basic cluster (non production, didn't need to maintain it very long).
None of the cloud Hadoop offerings use Ambari.
You previously had apache-spark, and for that, you could use just use Databricks and not need a whole Hadoop cluster.
Grafana.
Prometheus JMX Exporter can be added to all the Hadoop JVM processes. Node and Blackbox exporter can also be added for CPU/Mem host usage and TCP/HTTP healthchecks.
Recent HDP releases had already started using Grafana for displaying metrics and a lot of them duplicated the Ambari widgets.
Prometheus AlertManager, for example, but you may want something more robust like NewRelic, DataDog, etc.
HUE is probably the closest thing to File Browser, Hive Editor, etc.
Upvotes: 2
Reputation: 1659
You should check out:
Apache Bigtop - Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and configuration of the leading open source big data components. Bigtop supports a wide range of components/projects, including, but not limited to, Hadoop, HBase and Spark.
Upvotes: 0