This instruction can be applied both for single and multi-node deployment. It is based on the article: https://community.hortonworks.com/articles/60805/deploying-a-fresh-metron-cluster-using-ambari-serv.html and described how-to install HDP 2.5 with Metron. 

Check it out if anything goes wrong or in troubleshooting section below. There is troubleshooting section in this article as well, make sure to check it first.

IMPORTANT: Some of the values specified in this article need to be substituted with your info, like IP addresses and hostnames. Such values are enclosed with brackers <like that>. Please make sure replace them with your info. 

Prerequisites:

  • Centos 6

  • In order to ease install make sure that your main interface is named eth0. If it is not, you need to adjust some Ambari services configuration accordingly (e.g. ElasticSearch)

  • Single-node: At least 48 GB RAM, 8 cores and 400 GB HDD. Multi-node: At least 32 GB RAM, 4 cores and 200 GB HDD for smooth performance.

  • (optional) Disable PackageKit if it is installed, if it is not, just ignore this point:

    sed -i 's/enabled=1/enabled=0/g' /etc/yum/pluginconf.d/refresh-packagekit.conf
  • Increase limits for ElasticSearch and Storm on nodes where you will be installing them (if you don't know, increase it everywhere):

    echo -e "elasticsearch - memlock unlimited\nstorm - nproc 257597" >> /etc/security/limits.conf
  • Disable IPv6, leaving it enabled may force service to bind to IPv6 addresses only and thus resulting in inability to connect to it (source link):

    sysctl -w net.ipv6.conf.all.disable_ipv6=1
    sysctl -w net.ipv6.conf.default.disable_ipv6=1
    echo -e "\n# Disable IPv6\nnet.ipv6.conf.all.disable_ipv6 = 1\nnet.ipv6.conf.default.disable_ipv6 = 1" >> /etc/sysctl.conf
  • Disable Transparent Hugepage. Add "transparent_hugepage=never" to the end of the kernel line in grub.conf and reboot. (Ambari demands it, do we need to comply?):

    transparent_hugepage=never

    After reboot check that changes were applied (make sure that word "never" is selected in square-brackets):

    # cat /sys/kernel/mm/transparent_hugepage/enabled
    always madvise [never]


Metron install pre-preparation

On all nodes Install pre-requisites for Ambari:

yum install epel-release -y
yum update -y
yum install git wget curl rpm scp tar unzip bzip2 wget createrepo reposync yum-utils ntp python-pip -y


On main node where your Ambari will be located install following:

 

pip install --upgrade pip
pip install --upgrade setuptools

 

Install Maven on main node and on Metron node install java 1.8 (if you don't know what it is - install it everywhere):

 

yum install java-1.8.0-openjdk java-1.8.0-openjdk-devel -y

 

Set path to Java 8 if it does not exist:

export JAVA_HOME=$(readlink -f /usr/bin/java | sed "s_/jre/bin/java__")

Save export for future reboots:

echo 'export JAVA_HOME=$(readlink -f /usr/bin/java | sed "s_/jre/bin/java__")' > /etc/profile.d/java_18.sh
chmod +x /etc/profile.d/java_18.sh


Download and install Maven:

wget http://apache.volia.net/maven/maven-3/3.3.9/binaries/apache-maven-3.3.9-bin.tar.gz
tar -zxf apache-maven-3.3.9-bin.tar.gz; 
mv apache-maven-3.3.9 /opt/export 
PATH=/opt/apache-maven-3.3.9/bin:$PATH
echo 'export PATH=/opt/apache-maven-3.3.9/bin:$PATH' > /etc/profile.d/maven.sh
chmod +x /etc/profile.d/maven.sh


On Ambari node install and enable docker (we will need it to build Metron mpack for Ambari):

 

yum install docker-io -y
service docker start

 

Also on your build box, install npm (needed to build metron-config, part of the UI):

yum install npm -y

 

Build Metron code:

On main node clone Metron repository:

 

git clone https://github.com/apache/incubator-metron

 

If you install Metron on single node (not multi-node as advised) you need to modify ElasticSearch config templates to use only configuration specified below. Config templates are located in incubator-metron/metron-deployment/packaging/ambari/metron-mpack/src/main/resources/common-services/ELASTICSEARCH/2.3.3/package/templates/

cluster.name: metron
network.host: ["_eth0:ipv4_","_local:ipv4_"]
discovery.zen.ping.unicast.hosts: [ <single_node_hostname> ]
path.data: /opt/lmm/es_data
index.number_of_replicas: 0

Fix Kibana install file (it should be fixed in METRON-641:

sed -i 's@{}/kibana@{0}/kibana@g' incubator-metron/metron-deployment/packaging/ambari/metron-mpack/src/main/resources/common-services/KIBANA/4.5.1/package/scripts/kibana_master.py

Build Metron with HDP 2.5 profile:

 

cd incubator-metron
mvn clean install -DskipTests -PHDP-2.5.0.0
cd metron-deployment/packaging/docker/rpm-docker
mvn clean install -DskipTests -PHDP-2.5.0.0

 

On all nodes create localrepo directory and copy RPMs from Ambari node there:

 

mkdir /localrepo
cp -rp /root/incubator-metron/metron-deployment/packaging/docker/rpm-docker/RPMS/noarch/* /localrepo/

 

Use scp for remote nodes:

 

scp /localrepo/* <replace_with_node_ip>:/localrepo/

 

If passwordless ssh has not yet been set up within the cluster, then in main node generate key:

 

cat /dev/zero | ssh-keygen -q -N "" 2>/dev/null

 

Add this key to all the slave nodes:

 

ssh-copy-id -i ~/.ssh/id_rsa.pub <replace_with_node_ip>

 

Ambari 2.4 with HDP 2.5 install

Inspired by: http://docs.hortonworks.com/HDPDocuments/Ambari-2.4.1.0/bk_ambari-installation/content/ch_Getting_Ready.html

Adjust limits to secure level (inspired by link):

ulimit -n 32768
ulimit -u 65536
echo -e "* - nofile 32768\n* - nproc 65536" >> /etc/security/limits.conf


Enable time sync, disable firewall and SElinux:

 

chkconfig ntpd on
service ntpd start
chkconfig iptables off
/etc/init.d/iptables stop
setenforce 0

 

Make sure each node can resolve every other node's hostname or add hostname of each node to /etc/hosts on every node. For example add following lines in /etc/hosts of each node:

10.10.10.1 node1
10.10.10.2 node2
10.10.10.3 node3

Where 10.10.10.1, 10.10.10.2 and 10.10.10.3 are the IPs of your nodes and node1, node2 and node3 are hostnames.

On main node download and setup Ambari repo (you may replace the "2.4.1.0" with a newer Ambari version number):

 

wget -nv http://public-repo-1.hortonworks.com/ambari/centos6/2.x/updates/2.4.3.0/ambari.repo -O /etc/yum.repos.d/ambari.repo

 

Check that it was added:

 

# yum repolist | grep ambari
Updates-ambari-2.4.1.0 ambari-2.4.1.0 - Updates

 

Install and setup Ambari server:

 

yum install ambari-server -y
ambari-server setup -s

 

Add Metron service to Ambari by running mpack command (make sure to specify correct path to mpack in --mpack=):

 

ambari-server install-mpack --mpack=incubator-metron/metron-deployment/packaging/ambari/metron-mpack/target/metron_mpack-1.0.0.0-SNAPSHOT.tar.gz --verbose

 

Start Ambari:

 

ambari-server start

 

Access the Ambari UI by going to the following URL in a web browser (use admin / admin as user / pass):

 

http://<replace_with_master_node_ip>:8080/#/installer/step0

 

Get Started page: Enter any desired cluster name.

Select Version: Make sure "Public Repository" is checked.

Install Options: Specify hostnames of your nodes where Ambari cluster should be installed (all the ones you have specified in /etc/hosts) in Target Hosts. Copy content of the main node private key (/root/.ssh/id_rsa) in "Host Registration Information". If you receive warning like below, ignore it and click OK:

The following hostnames are not valid FQDNs

Choose Services: Select following Services:

 

Metron related

HDFS

YARN + MapReduce2

HBase

Zookeeper

Storm

Flume

Kafka

Elasticsearch

Kibana

Metron

Ambari Metrics

Dependencies

Pig

Tez

Slider

 

Assign Masters: Assign "Kafka Broker" on all nodes. Make sure move following components on one common node:

 

 

Storm UI Server

Metron Indexing

MySQL Server

Kibana Server

Elasticsearch Master

Metron Parsers

Metron Enrichment

Assign Slaves and Clients: select All for:

 

DataNode

NodeManager

RegionServer

Supervisor

Client

 

Customize Services: Following are the list of service that need to be configured:

 

ElasticSearch

Set zen_discovery_ping_unicast_hosts to: <replace_with_elasticsearch_master_hostname> (to the IP of the node where you assigned ElasticSearch Master on the Assign Master tab)

Kibana

Set kibana_es_url to: http://<replace_with_elasticsearch_master_hostname>:9200 (to the IP of the node where you assigned ElasticSearch Master on the Assign Master tab)

Metron

Set Elasticsearch Hosts to: <replace_with_elasticsearch_master_hostname> (to the IP of the node where you assigned ElasticSearch Master on the Assign Master tab)

Change global.json template from (unless it is already fixed as reported in METRON-642):

"es.ip": "{{ es_url }}",

to:

"es.ip": "<replace_with_elasticsearch_master_hostname>",

"es.port": "9300", 

 

Set rest of the configuration values to recommended by Ambari or the one you desire (like DB passwords) and perform install.

After install configuration

Fix ElasticSearch permission (it will crash right after start in Ambari) (unless it is already fixed as reported in METRON-642):

chown -Rh elasticsearch:elasticsearch /etc/elasticsearch

As it will be owned by root by default and will drop error:

Likely root cause: java.nio.file.AccessDeniedException: /etc/elasticsearch/scripts

 # ls -la /etc/elasticsearch
...
-rwxr-x---.   1 root          elasticsearch  2571 May 12 09:24 logging.yml
drwxr-x---.   2 root          elasticsearch  4096 May 17 11:49 scripts

Fix path to ES log file in Java parameter (unless it is already fixed as reported in METRON-642)::

sed -i 's@elasticsearchelasticsearch@elasticsearch/elasticsearch@g' /etc/sysconfig/elasticsearch

It is ok if some service will not able to start, check the errors and start them all manually.


TROUBLESHOOTING

Ignore the error Storm UI shown on the screenshot below if you've built your Metron code with HDP-2.5.0.0 profile (in Maven):

It appears because your Kafka Topic was not created or contains no data. Setup streaming and make sure your Kafka topic (from which topology should read data) exists.

 

If you have GUI installed on your server you should run following, before running git clone command:

 

unset SSH_ASKPASS

Or you may receive an error:

(gnome-ssh-askpass:15028): Gtk-WARNING **: cannot open display:

 

If Ambari metrics is not coming up, use this:

 

cd /usr/lib/python2.6/site-packages/resource_monitoring/
python psutil/build.py

 

And re-try.

If you receive an error:

Unsupported major.minor version 52.0

You may need to install Java 1.8 on the node you receive this from:

 

yum install java-1.8.0-openjdk java-1.8.0-openjdk-devel -y

 

 

 

  • No labels