上山打老虎 发表于 2021-7-3 21:40:21

Percona-mysql MHA高可用实战方案

  前言MHA(Master High Availability)目前在MySQL高可用方面是一个相对成熟的解决方案,它由日本DeNA公司youshimaton(现就职于Facebook公司)开发,是一套优秀的作为MySQLy高可用环境下故障切换和主从提升的高可用软件。在MySQL故障切换过程中,MHA能做到在0~30秒之内自动完成数据库的故障切换操作,并且在进行故障切换过程中,MHA能在最大程度上保证数据的一致性,以达到真正意义上的高可用。
      它由两部分组成:MHA Manager(管理节点)和MHA Node(数据节点)。MHA Manager可以单独部署在一台独立的机器上管理多个master-slave集群,也可以部署在一台slave上。
MHA node运行在每台MySQL服务器上,MHA Manager会定时探测集群中的master节点,当master 出现故障时,它可以自动将最新数据的slave提升为新的master,然后将所有其他的slave重新指向新的master。整个故障转移过程对应用程序是完全透明的。
1.   安装部署MHA前准备  MHA架构图

  
   具体搭建如表:
角色
IP地址
主机名
serverID
类型
Monitor host
192.168.127.100
MHA
监控集群组
Master
192.168.127.101
master
101
写入
Candicate master
192.168.127.102
slave01
102

slave
192.168.127.103
slave02
103

  
  vi /etc/hosts
  127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
  ::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
  192.168.127.100   MHA
  192.168.127.101   master
  192.168.127.102   slave01
  192.168.127.103   slave02

1.1.   percona-mysql安装(master、slave01、slave02 三台安装)
  注意:三台的server_id 不一样,为了做主从同步
  创建mysql用户:
  useradd mysql
  创建安装目录与数据目录:
  mkdir /app
  mkdir -p /data/mysql3306
  解决percona-mysql软件:
  tar zxvf Percona-Server-5.6.27-rel75.0-Linux.x86_64.ssl101.tar.gz
  注意:安装的软件需要根据openssl版本来下载
  rpm -qa | grep ssl
  openssl-1.0.1e-15.el6.x86_64
  把解压文件移动相应目录:
  mv Percona-Server-5.6.27-rel75.0-Linux.x86_64.ssl101 /app/mysql5.6
  
  创建放慢查询日志目录:
  mkdir /app/mysql5.6/logs
  给目录权限:
  chown -R mysql:mysql /app/mysql5.6
  chown -R mysql:mysql /data/mysql3306
  
  创建配置文件
  vi /app/mysql5.6/my.cnf
  
  
  socket=/app/mysql5.6/mysql.sock
  default-character-set=utf8
  port=3306
  
  prompt=\\u@\\d \\r:\\m:\\s>
  no-auto-rehash
  
  log-error=/data/mysql3306/mysqld.error
  
  socket=/app/mysql5.6/mysql.sock
  pid-file=/app/mysql5.6/mysqld.pid
  basedir=/app/mysql5.6
  datadir=/data/mysql3306
  port=3306
  server_id=101
  character-set-server=utf8
  skip-external-locking
  skip-name-resolve
  max_connections=1024
  max_connect_errors=1000
  wait_timeout =400
  interactive_timeout = 400
  table_definition_cache=500
  table_open_cache=500
  sort_buffer_size = 16M
  tmp_table_size = 200M
  
  read_buffer_size = 1M
  read_rnd_buffer_size = 4M
  myisam_sort_buffer_size = 64M
  thread_cache_size = 8
  query_cache_type=0
  query_cache_size=0
  thread_concurrency = 16
  lower_case_table_names = 1
  log_bin_trust_function_creators = 1
  #################slow log####################
  slow-query_log=1
  slow-query_log_file=/app/mysql5.6/logs/mysql.slow
  long_query_time=2
  ####################binlog######################
  log-bin=mysql-bin
  binlog-format=ROW
  expire_logs_days=5
  sync_binlog=1
  ################replication##########
  log-slave-updates=1
  ################INNODB################
  sql_mode=NO_ENGINE_SUBSTITUTION,STRICT_TRANS_TABLES
  transaction-isolation=READ-COMMITTED
  innodb_buffer_pool_size=10G
  innodb_flush_log_at_trx_commit=2
  innodb_strict_mode=1
  innodb_flush_method=O_DIRECT
  innodb_file_format=Barracuda
  innodb_log_files_in_group=3
  innodb_file_per_table=1
  innodb_io_capacity=500
  innodb_support_xa=1
  innodb_additional_mem_pool_size=16M
  innodb_log_buffer_size=64M
  
  
  
  quick
  max_allowed_packet=128M
  myisam_max_sort_sort_file_size=2G
  
  
  初始化数据库
  /app/mysql5.6/scripts/mysql_install_db --user=mysql --basedir=/app/mysql5.6 --datadir=/data/mysql3306   --defaults-file=/app/mysql5.6/my.cnf
  
  启动脚本
  cp/app/mysql5.6/support-files/mysql.server /etc/init.d/mysql
  vi /etc/init.d/mysql
  
  basedir=/app/mysql5.6
  datadir=/data/mysql3306
  
  注意:修改以上两处即可
  
  启动数据库
  
  /etc/init.d/mysql start
  
  Starting MySQL (Percona Server)....                        
  
  环境变量配置
  vi /etc/profile
  
  export MYSQL_HOME=/app/mysql5.6
  export MY_BASEDIR_VERSION=/app/mysql5.6
  export PATH=/app/mysql5.6/bin:/app/mysql5.6/scripts:$PATH
  export LD_LIBRARY_PATH=/app/mysql5.6/lib
  
  生效环境变量
  source /etc/profile

1.2   .主从同步搭建
  注意:防火墙需要关闭
  创建复制账号(master、slave1(mha新主))
  GRANT REPLICATIONSLAVE ON *.*TO 'repl'@'192.168.127.%'IDENTIFIED BY 'repl';
  flush privileges;
  
  查看master binlogPOS点信息
  root@(none) 06:47:05>show master status;
  +------------------+----------+--------------+------------------+-------------------+
  | File             | Position | Binlog_Do_DB | Binlog_Ignore_DB | Executed_Gtid_Set |
  +------------------+----------+--------------+------------------+-------------------+
  | mysql-bin.000004 |      409 |            |                  |                   |
  +------------------+----------+--------------+------------------+-------------------+
  1 row in set (0.01 sec)
  
  建立主从复制(slave01、slave02)
  # mysql
  Welcome to the MySQL monitor.Commands end with ; or \g.
  Your MySQL connection id is 2
  Server version: 5.6.27-75.0-log Percona Server (GPL), Release 75.0, Revision 8bb53b6
  
  Copyright (c) 2009-2015 Percona LLC and/or its affiliates
  Copyright (c) 2000, 2015, Oracle and/or its affiliates. All rights reserved.
  
  Oracle is a registered trademark of Oracle Corporation and/or its
  affiliates. Other names may be trademarks of their respective
  owners.
  
  Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.
  
  root@(none) 07:03:39>CHANGE MASTER TO MASTER_HOST='192.168.127.101',MASTER_PORT=3306,MASTER_USER='repl',MASTER_PASSWORD='repl',MASTER_LOG_FILE='mysql-bin.000004',MASTER_LOG_POS=409;
  Query OK, 0 rows affected, 2 warnings (0.05 sec)
  
  root@(none) 07:03:41>start slave;
  Query OK, 0 rows affected (0.02 sec)
  
  查看主从复制
  root@(none) 07:03:42>show slave status\G;
  *************************** 1. row ***************************
                 Slave_IO_State: Waiting for master to send event
                  Master_Host: 192.168.127.101
                    Master_User: repl
                    Master_Port: 3306
                  Connect_Retry: 60
              Master_Log_File: mysql-bin.000004
            Read_Master_Log_Pos: 409
                 Relay_Log_File: mysqld-relay-bin.000002
                  Relay_Log_Pos: 283
        Relay_Master_Log_File: mysql-bin.000004
               Slave_IO_Running: Yes
              Slave_SQL_Running: Yes
              Replicate_Do_DB:
            Replicate_Ignore_DB:
           Replicate_Do_Table:
         Replicate_Ignore_Table:
        Replicate_Wild_Do_Table:
  Replicate_Wild_Ignore_Table:
                     Last_Errno: 0
                     Last_Error:
                 Skip_Counter: 0
            Exec_Master_Log_Pos: 409
              Relay_Log_Space: 457
              Until_Condition: None
                 Until_Log_File:
                  Until_Log_Pos: 0
           Master_SSL_Allowed: No
           Master_SSL_CA_File:
           Master_SSL_CA_Path:
              Master_SSL_Cert:
              Master_SSL_Cipher:
                 Master_SSL_Key:
        Seconds_Behind_Master: 0
  Master_SSL_Verify_Server_Cert: No
                  Last_IO_Errno: 0
                  Last_IO_Error:
                 Last_SQL_Errno: 0
                 Last_SQL_Error:
  Replicate_Ignore_Server_Ids:
               Master_Server_Id: 101
                    Master_UUID: 8b1cf62d-e063-11e5-84ba-000c2908253f
               Master_Info_File: /data/mysql3306/master.info
                    SQL_Delay: 0
            SQL_Remaining_Delay: NULL
        Slave_SQL_Running_State: Slave has read all relay log; waiting for the slave I/O thread to update it
           Master_Retry_Count: 86400
                    Master_Bind:
        Last_IO_Error_Timestamp:
     Last_SQL_Error_Timestamp:
                 Master_SSL_Crl:
           Master_SSL_Crlpath:
           Retrieved_Gtid_Set:
              Executed_Gtid_Set:
                  Auto_Position: 0
  1 row in set (0.02 sec)
  
  ERROR:
  No query specified
  
  以上主从已经搭建好,下面我们安装与配置MHA
  (1)slave服务器(192.168.127.102,192.168.103)设置read only;
  mysql> set global read_only=1;
  (2)设置relay log清除方式(在每个slave 下)
  mysql> set global relay_log_purge=0;
  (3)创建监控用户,在所有MYSQL上执行
  mysql> grant all privileges on *.* to 'root'@'192.168.127.%' identified by '123456';
  mysql>flush privileges;
  
  
  (4)在slave01(192.168.127.102)上创建复制用户:
  mysql> grant replication slave on *.* to 'repl'@'192.168.127.%' identified by 'repl';
  mysql>flush privileges;
  
  
2.安装部署MHA
2.1安装MHA node(在所有Mysql服务器上安装)
  (1)安装依赖包
  rpm -Uvh http://dl.fedoraproject.org/pub/epel/6/x86_64/epel-release-6-8.noarch.rpm
  rpm --import /etc/pki/rpm-gpg/RPM-GPG-KEY-EPEL-6
  yum -y install perl-DBD-MySQL perl-Config-Tiny perl-Log-Dispatch perl-Parallel-ForkManager perl-Config-IniFiles perl-Time-HiResperl-Time-HiResperl-CPAN
  
  (2)在所有的节点上安装mha node:
  tarzxvfmha4mysql-node-0.56.tar.gz
  cdmha4mysql-node-0.56
  perlMakefile.PL
  make
  make install
  

2.2.安装MHA Manager
  MHA Manager中主要包括了几个管理员的命令行工具,例如masterha_manager,masterha_master_switch等。
  (1)       安装依赖包
rpm -Uvh http://dl.fedoraproject.org/pub/epel/6/x86_64/epel-release-6-8.noarch.rpm
rpm --import /etc/pki/rpm-gpg/RPM-GPG-KEY-EPEL-6
yum -y install perl-DBD-MySQL perl-Config-Tiny perl-Log-Dispatch perl-Parallel-ForkManager perl-Config-IniFiles perl-Time-HiResperl-Time-HiResperl-CPAN

  (2)       安装MHA node软件包。注意,在MHA Manger的主机上也要安装MHA node.
tarzxvfmha4mysql-node-0.56.tar.gz
cdmha4mysql-node-0.56
perlMakefile.PL
make
make install

  (3)       安装MHA Manager软件包。
tar zxvf mha4mysql-manager-0.56.tar.gz
cd mha4mysql-manager-0.56
perl Makefile.PL
make
make install
  

2.3. 配置SSH 登录无密码验证
  (1)       在manager (192.168.127.100)上配置到所有节点的无密码验证
  ssh-keygen -t rsa
ssh-copy-id -i ~/.ssh/id_rsa.pub root@MHA
ssh-copy-id -i ~/.ssh/id_rsa.pub root@master
  ssh-copy-id -i ~/.ssh/id_rsa.pub root@slave01
ssh-copy-id -i ~/.ssh/id_rsa.pub root@slave02
  (2)       在MHA Node master(192.168.127.101)上:
   ssh-keygen -t rsa
   ssh-copy-id -i ~/.ssh/id_rsa.pub root@MHA
ssh-copy-id -i ~/.ssh/id_rsa.pub root@master
ssh-copy-id -i ~/.ssh/id_rsa.pub root@slave01
   ssh-copy-id -i ~/.ssh/id_rsa.pub root@slave02
  (3)       在MHA Node slave01(192.168.127.102)上:
   ssh-keygen -t rsa
ssh-copy-id -i ~/.ssh/id_rsa.pub root@MHA
   ssh-copy-id -i ~/.ssh/id_rsa.pub root@master
ssh-copy-id -i ~/.ssh/id_rsa.pub root@slave01
   ssh-copy-id -i ~/.ssh/id_rsa.pub root@slave02
  (4)       在MHA Node slave02(192.168.127.103)上:
   ssh-keygen -t rsa
ssh-copy-id -i ~/.ssh/id_rsa.pub root@MHA
   ssh-copy-id -i ~/.ssh/id_rsa.pub root@master
   ssh-copy-id -i ~/.ssh/id_rsa.pub root@slave01
ssh-copy-id -i ~/.ssh/id_rsa.pub root@slave02
  
     在每台做以下步骤
  ln -s /app/mysql5.6/bin/* /usr/local/bin/
  把脚本拷贝相关目录
  # cp /root/mha4mysql-manager-0.56/samples/scripts/master_ip_failover /usr/local/bin/
  
  # cp /root/mha4mysql-manager-0.56/samples/scripts/master_ip_online_change /usr/local/bin/
  
  #cp /root/mha4mysql-manager-0.56/samples/scripts/send_report/usr/local/bin/
  
  # cp /root/mha4mysql-manager-0.56/bin/masterha_secondary_check /usr/bin/
3.配置MHA  配置MHA的步骤如下。
  (1)       创建MHA工作目录,并且创建相关配置文件:
mkdir -p /etc/masterha
mkdir -p /masterha/app1
        配置如下
vi /etc/masterha/app1.cnf

manager_workdir=/masterha/app1
manager_log=/masterha/app1/app1.log
master_ip_failover_script=/usr/local/bin/master_ip_failover
master_ip_online_change_script=/usr/local/bin/master_ip_online_change

user=root
password=123456
ssh_user=root
repl_user=repl
repl_password=repl
ping_interval=1
remote_workdir=/tmp
report_script=/usr/local/bin/send_report
secondary_check_script=/usr/bin/masterha_secondary_check-s MHA -s slave02--user=root --master_host=master --master_ip=192.168.127.101 --master_port=3306 --password=123456
shutdown_script=""
report_script=""



hostname=192.168.127.101
master_binlog_dir=/data/mysql3306
candidate_master=1

hostname=192.168.127.102
master_binlog_dir=/data/mysql3306
candidate_master=1
check_repl_delay=0


hostname=192.168.127.103
master_binlog_dir=/data/mysql3306
no_master=1

4.检查SSH的配置检查MHA Manager到所有MHA node的SSH连接状态:
  # masterha_check_ssh --conf=/etc/masterha/app1.cnf
  Wed Mar2 19:03:30 2016 - Global configuration file /etc/masterha_default.cnf not found. Skipping.
  Wed Mar2 19:03:30 2016 - Reading application default configuration from /etc/masterha/app1.cnf..
  Wed Mar2 19:03:30 2016 - Reading server configuration from /etc/masterha/app1.cnf..
  Wed Mar2 19:03:30 2016 - Starting SSH connection tests..
  Wed Mar2 19:03:31 2016 -
  Wed Mar2 19:03:30 2016 - Connecting via SSH from root@192.168.127.101(192.168.127.101:22) to root@192.168.127.102(192.168.127.102:22)..
  Wed Mar2 19:03:30 2016 -    ok.
  Wed Mar2 19:03:30 2016 - Connecting via SSH from root@192.168.127.101(192.168.127.101:22) to root@192.168.127.103(192.168.127.103:22)..
  Wed Mar2 19:03:30 2016 -    ok.
  Wed Mar2 19:03:31 2016 -
  Wed Mar2 19:03:30 2016 - Connecting via SSH from root@192.168.127.102(192.168.127.102:22) to root@192.168.127.101(192.168.127.101:22)..
  Wed Mar2 19:03:31 2016 -    ok.
  Wed Mar2 19:03:31 2016 - Connecting via SSH from root@192.168.127.102(192.168.127.102:22) to root@192.168.127.103(192.168.127.103:22)..
  Wed Mar2 19:03:31 2016 -    ok.
  Wed Mar2 19:03:32 2016 -
  Wed Mar2 19:03:31 2016 - Connecting via SSH from root@192.168.127.103(192.168.127.103:22) to root@192.168.127.101(192.168.127.101:22)..
  Wed Mar2 19:03:31 2016 -    ok.
  Wed Mar2 19:03:31 2016 - Connecting via SSH from root@192.168.127.103(192.168.127.103:22) to root@192.168.127.102(192.168.127.102:22)..
  Wed Mar2 19:03:32 2016 -    ok.
Wed Mar2 19:03:32 2016 - All SSH connection tests passed successfully.
5.检查整个复制环境  # masterha_check_repl --conf=/etc/masterha/app1.cnf
  Wed Mar2 19:03:30 2016 - Global configuration file /etc/masterha_default.cnf not found. Skipping.
  Wed Mar2 19:03:30 2016 - Reading application default configuration from /etc/masterha/app1.cnf..
  Wed Mar2 19:03:30 2016 - Reading server configuration from /etc/masterha/app1.cnf..
  Wed Mar2 19:03:30 2016 - Starting SSH connection tests..
  Wed Mar2 19:03:31 2016 -
  Wed Mar2 19:03:30 2016 - Connecting via SSH from root@192.168.127.101(192.168.127.101:22) to root@192.168.127.102(192.168.127.102:22)..
  Wed Mar2 19:03:30 2016 -    ok.
  Wed Mar2 19:03:30 2016 - Connecting via SSH from root@192.168.127.101(192.168.127.101:22) to root@192.168.127.103(192.168.127.103:22)..
  Wed Mar2 19:03:30 2016 -    ok.
  Wed Mar2 19:03:31 2016 -
  Wed Mar2 19:03:30 2016 - Connecting via SSH from root@192.168.127.102(192.168.127.102:22) to root@192.168.127.101(192.168.127.101:22)..
  Wed Mar2 19:03:31 2016 -    ok.
  Wed Mar2 19:03:31 2016 - Connecting via SSH from root@192.168.127.102(192.168.127.102:22) to root@192.168.127.103(192.168.127.103:22)..
  Wed Mar2 19:03:31 2016 -    ok.
  Wed Mar2 19:03:32 2016 -
  Wed Mar2 19:03:31 2016 - Connecting via SSH from root@192.168.127.103(192.168.127.103:22) to root@192.168.127.101(192.168.127.101:22)..
  Wed Mar2 19:03:31 2016 -    ok.
  Wed Mar2 19:03:31 2016 - Connecting via SSH from root@192.168.127.103(192.168.127.103:22) to root@192.168.127.102(192.168.127.102:22)..
  Wed Mar2 19:03:32 2016 -    ok.
  Wed Mar2 19:03:32 2016 - All SSH connection tests passed successfully.
  # masterha_check_repl --conf=/etc/masterha/app1.cnf
  Wed Mar2 19:04:12 2016 - Global configuration file /etc/masterha_default.cnf not found. Skipping.
  Wed Mar2 19:04:12 2016 - Reading application default configuration from /etc/masterha/app1.cnf..
  Wed Mar2 19:04:12 2016 - Reading server configuration from /etc/masterha/app1.cnf..
  Wed Mar2 19:04:12 2016 - MHA::MasterMonitor version 0.56.
  Wed Mar2 19:04:12 2016 - GTID failover mode = 0
  Wed Mar2 19:04:12 2016 - Dead Servers:
  Wed Mar2 19:04:12 2016 - Alive Servers:
  Wed Mar2 19:04:12 2016 -    192.168.127.101(192.168.127.101:3306)
  Wed Mar2 19:04:12 2016 -    192.168.127.102(192.168.127.102:3306)
  Wed Mar2 19:04:12 2016 -    192.168.127.103(192.168.127.103:3306)
  Wed Mar2 19:04:12 2016 - Alive Slaves:
  Wed Mar2 19:04:12 2016 -    192.168.127.102(192.168.127.102:3306)Version=5.6.27-75.0-log (oldest major version between slaves) log-bin:enabled
  Wed Mar2 19:04:12 2016 -    Replicating from 192.168.127.101(192.168.127.101:3306)
  Wed Mar2 19:04:12 2016 -    Primary candidate for the new Master (candidate_master is set)
  Wed Mar2 19:04:12 2016 -    192.168.127.103(192.168.127.103:3306)Version=5.6.27-75.0-log (oldest major version between slaves) log-bin:enabled
  Wed Mar2 19:04:12 2016 -    Replicating from 192.168.127.101(192.168.127.101:3306)
  Wed Mar2 19:04:12 2016 -    Not candidate for the new Master (no_master is set)
  Wed Mar2 19:04:12 2016 - Current Alive Master: 192.168.127.101(192.168.127.101:3306)
  Wed Mar2 19:04:12 2016 - Checking slave configurations..
  Wed Mar2 19:04:12 2016 - Checking replication filtering settings..
  Wed Mar2 19:04:12 2016 - binlog_do_db= , binlog_ignore_db=
  Wed Mar2 19:04:12 2016 - Replication filtering check ok.
  Wed Mar2 19:04:12 2016 - GTID (with auto-pos) is not supported
  Wed Mar2 19:04:12 2016 - Starting SSH connection tests..
  Wed Mar2 19:04:14 2016 - All SSH connection tests passed successfully.
  Wed Mar2 19:04:14 2016 - Checking MHA Node version..
  Wed Mar2 19:04:15 2016 - Version check ok.
  Wed Mar2 19:04:15 2016 - Checking SSH publickey authentication settings on the current master..
  Wed Mar2 19:04:15 2016 - HealthCheck: SSH to 192.168.127.101 is reachable.
  Wed Mar2 19:04:15 2016 - Master MHA Node version is 0.56.
  Wed Mar2 19:04:15 2016 - Checking recovery script configurations on 192.168.127.101(192.168.127.101:3306)..
  Wed Mar2 19:04:15 2016 -    Executing command: save_binary_logs --command=test --start_pos=4 --binlog_dir=/data/mysql3306 --output_file=/tmp/save_binary_logs_test --manager_version=0.56 --start_file=mysql-bin.000004
  Wed Mar2 19:04:15 2016 -    Connecting to root@192.168.127.101(192.168.127.101:22)..
  Creating /tmp if not exists..    ok.
  Checking output directory is accessible or not..
     ok.
  Binlog found at /data/mysql3306, up to mysql-bin.000004
  Wed Mar2 19:04:15 2016 - Binlog setting check done.
  Wed Mar2 19:04:15 2016 - Checking SSH publickey authentication and checking recovery script configurations on all alive slave servers..
  Wed Mar2 19:04:15 2016 -    Executing command : apply_diff_relay_logs --command=test --slave_user='root' --slave_host=192.168.127.102 --slave_ip=192.168.127.102 --slave_port=3306 --workdir=/tmp --target_version=5.6.27-75.0-log --manager_version=0.56 --relay_log_info=/data/mysql3306/relay-log.info--relay_dir=/data/mysql3306/--slave_pass=xxx
  Wed Mar2 19:04:15 2016 -    Connecting to root@192.168.127.102(192.168.127.102:22)..
  Checking slave recovery environment settings..
      Opening /data/mysql3306/relay-log.info ... ok.
      Relay log found at /data/mysql3306, up to mysqld-relay-bin.000002
      Temporary relay log file is /data/mysql3306/mysqld-relay-bin.000002
      Testing mysql connection and privileges..Warning: Using a password on the command line interface can be insecure.
   done.
      Testing mysqlbinlog output.. done.
      Cleaning up test file(s).. done.
  Wed Mar2 19:04:16 2016 -    Executing command : apply_diff_relay_logs --command=test --slave_user='root' --slave_host=192.168.127.103 --slave_ip=192.168.127.103 --slave_port=3306 --workdir=/tmp --target_version=5.6.27-75.0-log --manager_version=0.56 --relay_log_info=/data/mysql3306/relay-log.info--relay_dir=/data/mysql3306/--slave_pass=xxx
  Wed Mar2 19:04:16 2016 -    Connecting to root@192.168.127.103(192.168.127.103:22)..
  Checking slave recovery environment settings..
      Opening /data/mysql3306/relay-log.info ... ok.
      Relay log found at /data/mysql3306, up to mysqld-relay-bin.000002
      Temporary relay log file is /data/mysql3306/mysqld-relay-bin.000002
      Testing mysql connection and privileges..Warning: Using a password on the command line interface can be insecure.
   done.
      Testing mysqlbinlog output.. done.
      Cleaning up test file(s).. done.
  Wed Mar2 19:04:16 2016 - Slaves settings check done.
  Wed Mar2 19:04:16 2016 -

[*]
   +--192.168.127.102(192.168.127.102:3306)
   +--192.168.127.103(192.168.127.103:3306)
  Wed Mar2 19:04:16 2016 - Checking replication health on 192.168.127.102..
  Wed Mar2 19:04:16 2016 - ok.
  Wed Mar2 19:04:16 2016 - Checking replication health on 192.168.127.103..
  Wed Mar2 19:04:16 2016 - ok.
  Wed Mar2 19:04:16 2016 - Checking master_ip_failover_script status:
  Wed Mar2 19:04:16 2016 -    /usr/local/bin/master_ip_failover --command=status --ssh_user=root --orig_master_host=192.168.127.101 --orig_master_ip=192.168.127.101 --orig_master_port=3306
  Bareword "FIXME_xxx" not allowed while "strict subs" in use at /usr/local/bin/master_ip_failover line 93.
  Execution of /usr/local/bin/master_ip_failover aborted due to compilation errors.
  Wed Mar2 19:04:16 2016 - Failed to get master_ip_failover_script status with return code 255:0.
  Wed Mar2 19:04:16 2016 - Error happened on checking configurations.at /usr/local/bin/masterha_check_repl line 48
  Wed Mar2 19:04:16 2016 - Error happened on monitoring servers.
  Wed Mar2 19:04:16 2016 - Got exit code 1 (Not master dead).
MySQL Replication Health is NOT OK!
说明以上没有成功需要修改以上的问题
把93行#FIXME_xxx;注释掉


  # masterha_check_repl --conf=/etc/masterha/app1.cnf
  Wed Mar2 19:04:52 2016 - Global configuration file /etc/masterha_default.cnf not found. Skipping.
  Wed Mar2 19:04:52 2016 - Reading application default configuration from /etc/masterha/app1.cnf..
  Wed Mar2 19:04:52 2016 - Reading server configuration from /etc/masterha/app1.cnf..
  Wed Mar2 19:04:52 2016 - MHA::MasterMonitor version 0.56.
  Wed Mar2 19:04:52 2016 - GTID failover mode = 0
  Wed Mar2 19:04:52 2016 - Dead Servers:
  Wed Mar2 19:04:52 2016 - Alive Servers:
  Wed Mar2 19:04:52 2016 -    192.168.127.101(192.168.127.101:3306)
  Wed Mar2 19:04:52 2016 -    192.168.127.102(192.168.127.102:3306)
  Wed Mar2 19:04:52 2016 -    192.168.127.103(192.168.127.103:3306)
  Wed Mar2 19:04:52 2016 - Alive Slaves:
  Wed Mar2 19:04:52 2016 -    192.168.127.102(192.168.127.102:3306)Version=5.6.27-75.0-log (oldest major version between slaves) log-bin:enabled
  Wed Mar2 19:04:52 2016 -    Replicating from 192.168.127.101(192.168.127.101:3306)
  Wed Mar2 19:04:52 2016 -    Primary candidate for the new Master (candidate_master is set)
  Wed Mar2 19:04:52 2016 -    192.168.127.103(192.168.127.103:3306)Version=5.6.27-75.0-log (oldest major version between slaves) log-bin:enabled
  Wed Mar2 19:04:52 2016 -    Replicating from 192.168.127.101(192.168.127.101:3306)
  Wed Mar2 19:04:52 2016 -    Not candidate for the new Master (no_master is set)
  Wed Mar2 19:04:52 2016 - Current Alive Master: 192.168.127.101(192.168.127.101:3306)
  Wed Mar2 19:04:52 2016 - Checking slave configurations..
  Wed Mar2 19:04:52 2016 - Checking replication filtering settings..
  Wed Mar2 19:04:52 2016 - binlog_do_db= , binlog_ignore_db=
  Wed Mar2 19:04:52 2016 - Replication filtering check ok.
  Wed Mar2 19:04:52 2016 - GTID (with auto-pos) is not supported
  Wed Mar2 19:04:52 2016 - Starting SSH connection tests..
  Wed Mar2 19:04:54 2016 - All SSH connection tests passed successfully.
  Wed Mar2 19:04:54 2016 - Checking MHA Node version..
  Wed Mar2 19:04:54 2016 - Version check ok.
  Wed Mar2 19:04:54 2016 - Checking SSH publickey authentication settings on the current master..
  Wed Mar2 19:04:54 2016 - HealthCheck: SSH to 192.168.127.101 is reachable.
  Wed Mar2 19:04:55 2016 - Master MHA Node version is 0.56.
  Wed Mar2 19:04:55 2016 - Checking recovery script configurations on 192.168.127.101(192.168.127.101:3306)..
  Wed Mar2 19:04:55 2016 -    Executing command: save_binary_logs --command=test --start_pos=4 --binlog_dir=/data/mysql3306 --output_file=/tmp/save_binary_logs_test --manager_version=0.56 --start_file=mysql-bin.000004
  Wed Mar2 19:04:55 2016 -    Connecting to root@192.168.127.101(192.168.127.101:22)..
  Creating /tmp if not exists..    ok.
  Checking output directory is accessible or not..
     ok.
  Binlog found at /data/mysql3306, up to mysql-bin.000004
  Wed Mar2 19:04:55 2016 - Binlog setting check done.
  Wed Mar2 19:04:55 2016 - Checking SSH publickey authentication and checking recovery script configurations on all alive slave servers..
  Wed Mar2 19:04:55 2016 -    Executing command : apply_diff_relay_logs --command=test --slave_user='root' --slave_host=192.168.127.102 --slave_ip=192.168.127.102 --slave_port=3306 --workdir=/tmp --target_version=5.6.27-75.0-log --manager_version=0.56 --relay_log_info=/data/mysql3306/relay-log.info--relay_dir=/data/mysql3306/--slave_pass=xxx
  Wed Mar2 19:04:55 2016 -    Connecting to root@192.168.127.102(192.168.127.102:22)..
  Checking slave recovery environment settings..
      Opening /data/mysql3306/relay-log.info ... ok.
      Relay log found at /data/mysql3306, up to mysqld-relay-bin.000002
      Temporary relay log file is /data/mysql3306/mysqld-relay-bin.000002
      Testing mysql connection and privileges..Warning: Using a password on the command line interface can be insecure.
   done.
      Testing mysqlbinlog output.. done.
      Cleaning up test file(s).. done.
  Wed Mar2 19:04:55 2016 -    Executing command : apply_diff_relay_logs --command=test --slave_user='root' --slave_host=192.168.127.103 --slave_ip=192.168.127.103 --slave_port=3306 --workdir=/tmp --target_version=5.6.27-75.0-log --manager_version=0.56 --relay_log_info=/data/mysql3306/relay-log.info--relay_dir=/data/mysql3306/--slave_pass=xxx
  Wed Mar2 19:04:55 2016 -    Connecting to root@192.168.127.103(192.168.127.103:22)..
  Checking slave recovery environment settings..
      Opening /data/mysql3306/relay-log.info ... ok.
      Relay log found at /data/mysql3306, up to mysqld-relay-bin.000002
      Temporary relay log file is /data/mysql3306/mysqld-relay-bin.000002
      Testing mysql connection and privileges..Warning: Using a password on the command line interface can be insecure.
   done.
      Testing mysqlbinlog output.. done.
      Cleaning up test file(s).. done.
  Wed Mar2 19:04:55 2016 - Slaves settings check done.
  Wed Mar2 19:04:55 2016 -

[*]
   +--192.168.127.102(192.168.127.102:3306)
   +--192.168.127.103(192.168.127.103:3306)
  Wed Mar2 19:04:55 2016 - Checking replication health on 192.168.127.102..
  Wed Mar2 19:04:55 2016 - ok.
  Wed Mar2 19:04:55 2016 - Checking replication health on 192.168.127.103..
  Wed Mar2 19:04:55 2016 - ok.
  Wed Mar2 19:04:55 2016 - Checking master_ip_failover_script status:
  Wed Mar2 19:04:55 2016 -    /usr/local/bin/master_ip_failover --command=status --ssh_user=root --orig_master_host=192.168.127.101 --orig_master_ip=192.168.127.101 --orig_master_port=3306
  Wed Mar2 19:04:55 2016 - OK.
  Wed Mar2 19:04:55 2016 - shutdown_script is not defined.
  Wed Mar2 19:04:55 2016 - Got exit code 0 (Not master dead).
MySQL Replication Health is OK.

说明成功

6.通过脚本管理 VIP  修改master_ip_failover文件(/usr/local/bin)
  
  #!/usr/bin/env perl
  
  #Copyright (C) 2011 DeNA Co.,Ltd.
  #
  #This program is free software; you can redistribute it and/or modify
  #it under the terms of the GNU General Public License as published by
  #the Free Software Foundation; either version 2 of the License, or
  #(at your option) any later version.
  #
  #This program is distributed in the hope that it will be useful,
  #but WITHOUT ANY WARRANTY; without even the implied warranty of
  #MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.See the
  #GNU General Public License for more details.
  #
  #You should have received a copy of the GNU General Public License
  #   along with this program; if not, write to the Free Software
  #Foundation, Inc.,
  #51 Franklin Street, Fifth Floor, Boston, MA02110-1301USA
  
  ## Note: This is a sample script and is not complete. Modify the script based on your environment.
  
  use strict;
  use warnings FATAL => 'all';
  
  use Getopt::Long;
  
  my (
  $command,          $ssh_user,      $orig_master_host, $orig_master_ip,
  $orig_master_port, $new_master_host, $new_master_ip,    $new_master_port
  );
  
  my $vip='192.168.127.202/24';
  my $key="2";
  my $ssh_start_vip ="/sbin/ifconfig eth0:$key $vip";
  my $ssh_stop_vip="/sbin/ifconfig eth0:$key down";
  
  GetOptions(
  'command=s'          => \$command,
  'ssh_user=s'         => \$ssh_user,
  'orig_master_host=s' => \$orig_master_host,
  'orig_master_ip=s'   => \$orig_master_ip,
  'orig_master_port=i' => \$orig_master_port,
  'new_master_host=s'=> \$new_master_host,
  'new_master_ip=s'    => \$new_master_ip,
  'new_master_port=i'=> \$new_master_port,
  );
  
  exit &main();
  
  sub main {
  if ( $command eq "stop" || $command eq "stopssh" ) {
  
      # $orig_master_host, $orig_master_ip, $orig_master_port are passed.
      # If you manage master ip address at global catalog database,
      # invalidate orig_master_ip here.
      my $exit_code = 1;
      eval {
        
        print "Disabling the VIP on old master: $orig_master_host \n";
           &stop_vip();
        $exit_code = 0;
      };
      if ($@) {
        warn "Got Error: $@\n";
        exit $exit_code;
      }
      exit $exit_code;
  }
  elsif ( $command eq "start" ) {
  
      # all arguments are passed.
      # If you manage master ip address at global catalog database,
      # activate new_master_ip here.
      # You can also grant write access (create user, set read_only=0, etc) here.
      my $exit_code = 10;
      eval {
           print "Enabling the VIP - $vip on the new master - $new_master_host \n";
           &start_vip();
        $exit_code = 0;
      };
      if ($@) {
        warn $@;
  
        # If you want to continue failover, exit 10.
        exit $exit_code;
      }
      exit $exit_code;
  }
  elsif ( $command eq "status" ) {
      print "Checking the Status of the script.. ok \n";
      # do nothing
      exit 0;
  }
  else {
      &usage();
      exit 1;
  }
  }
  
  sub start_vip(){
           `ssh $ssh_user\@$new_master_host \ " $ssh_start_vip \"`;
  }
  
  sub stop_vip(){
        `ssh $ssh_user\@$orig_master_host \ " $ssh_stop_vip \"`;
  }
  sub usage {
  print
  "Usage: master_ip_failover --command=start|stop|stopssh|status --orig_master_host=host --orig_master_ip=ip --orig_master_port=port --new_master_host=host --new_master_ip=ip --new_master_port=port\n";
  }
  
  注意:首先启动VIP在192.168.127.101(master)上
  /sbin/ifconfig eth0:2 192.168.127.202/24
7.开启MHA Manager监控  nohup masterha_manager --conf=/etc/masterha/app1.cnf > /masterha/app1/manager.log</dev/null 2>&1 &
8.查看启动状态  # masterha_check_status --conf=/etc/masterha/app1.cnf
  app1 (pid:27237) is running(0:PING_OK), master:192.168.127.101
  
9. 查看启动日志  # tail -f /masterha/app1/app1.log
   +--192.168.127.103(192.168.127.103:3306)
  
  Wed Mar2 19:08:34 2016 - Checking master_ip_failover_script status:
  Wed Mar2 19:08:34 2016 -    /usr/local/bin/master_ip_failover --command=status --ssh_user=root --orig_master_host=192.168.127.101 --orig_master_ip=192.168.127.101 --orig_master_port=3306
  Wed Mar2 19:08:34 2016 - OK.
  Wed Mar2 19:08:34 2016 - shutdown_script is not defined.
  Wed Mar2 19:08:34 2016 - Set master ping interval 1 seconds.
  Wed Mar2 19:08:34 2016 - Set secondary check script: /usr/bin/masterha_secondary_check-s MHA -s slave02--user=root --master_host=master --master_ip=192.168.127.101 --master_port=3306 --password=123456
  Wed Mar2 19:08:34 2016 - Starting ping health check on 192.168.127.101(192.168.127.101:3306)..
  Wed Mar2 19:08:34 2016 - Ping(SELECT) succeeded, waiting until MySQL doesn't respond..
  
  查看VIP
  # ip addr
  1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
      link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
      inet 127.0.0.1/8 scope host lo
      inet6 ::1/128 scope host
         valid_lft forever preferred_lft forever
  2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
      link/ether 00:0c:29:08:25:3f brd ff:ff:ff:ff:ff:ff
      inet 192.168.127.101/24 brd 192.168.127.255 scope global eth0
      inet 192.168.127.202/24 brd 192.168.127.255 scope global secondary eth0:2
      inet6 fe80::20c:29ff:fe08:253f/64 scope link
         valid_lft forever preferred_lft forever
  3: pan0: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN
link/ether 0e:ed:39:ba:c1:1b brd ff:ff:ff:ff:ff:ff


10.测试切换
测试关闭主库
# /etc/init.d/mysql stop
Shutting down MySQL (Percona Server)......               
查看slave02复制状态:
# mysql
Welcome to the MySQL monitor.Commands end with ; or \g.
Your MySQL connection id is 27
Server version: 5.6.27-75.0-log Percona Server (GPL), Release 75.0, Revision 8bb53b6

Copyright (c) 2009-2015 Percona LLC and/or its affiliates
Copyright (c) 2000, 2015, Oracle and/or its affiliates. All rights reserved.

Oracle is a registered trademark of Oracle Corporation and/or its
affiliates. Other names may be trademarks of their respective
owners.

Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.

root@(none) 07:42:08>show slave status\G;
*************************** 1. row ***************************
               Slave_IO_State: Waiting for master to send event
                  Master_Host: 192.168.127.102#已经自动切换了
                  Master_User: repl
                  Master_Port: 3306
                Connect_Retry: 60
            Master_Log_File: mysql-bin.000003
          Read_Master_Log_Pos: 981
               Relay_Log_File: mysqld-relay-bin.000002
                Relay_Log_Pos: 283
      Relay_Master_Log_File: mysql-bin.000003
             Slave_IO_Running: Yes
            Slave_SQL_Running: Yes
            Replicate_Do_DB:
          Replicate_Ignore_DB:
         Replicate_Do_Table:
       Replicate_Ignore_Table:
      Replicate_Wild_Do_Table:
Replicate_Wild_Ignore_Table:
                   Last_Errno: 0
                   Last_Error:
               Skip_Counter: 0
          Exec_Master_Log_Pos: 981
            Relay_Log_Space: 457
            Until_Condition: None
               Until_Log_File:
                Until_Log_Pos: 0
         Master_SSL_Allowed: No
         Master_SSL_CA_File:
         Master_SSL_CA_Path:
            Master_SSL_Cert:
            Master_SSL_Cipher:
               Master_SSL_Key:
      Seconds_Behind_Master: 0
Master_SSL_Verify_Server_Cert: No
                Last_IO_Errno: 0
                Last_IO_Error:
               Last_SQL_Errno: 0
               Last_SQL_Error:
Replicate_Ignore_Server_Ids:
             Master_Server_Id: 102
                  Master_UUID: 1bb38a96-e066-11e5-84cb-000c2976ee35
             Master_Info_File: /data/mysql3306/master.info
                  SQL_Delay: 0
          SQL_Remaining_Delay: NULL
      Slave_SQL_Running_State: Slave has read all relay log; waiting for the slave I/O thread to update it
         Master_Retry_Count: 86400
                  Master_Bind:
      Last_IO_Error_Timestamp:
   Last_SQL_Error_Timestamp:
               Master_SSL_Crl:
         Master_SSL_Crlpath:
         Retrieved_Gtid_Set:
            Executed_Gtid_Set:
                Auto_Position: 0
1 row in set (0.00 sec)

ERROR:
No query specified
查看VIP漂移slave01(192.168.247.102)上
# ip addr
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
    inet6 ::1/128 scope host
       valid_lft forever preferred_lft forever
2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
    link/ether 00:0c:29:76:ee:35 brd ff:ff:ff:ff:ff:ff
    inet 192.168.127.102/24 brd 192.168.127.255 scope global eth0
    inet 192.168.127.202/24 brd 192.168.127.255 scope global secondary eth0:2
    inet6 fe80::20c:29ff:fe76:ee35/64 scope link
       valid_lft forever preferred_lft forever
3: pan0: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN
    link/ether 1e:77:57:63:5e:b0 brd ff:ff:ff:ff:ff:ff

10. 修改宕机的Master通常情况自动切换后,原master 可能已经废弃掉,待原master 主机修改很复后,如果数据完整的情况,可能想把原master重新作为新主库的slave,这是我们就需要借助当时自动切换时刻的MHA日志来完成对原master的修复。下面是提取相关日志的命令:
  
  # grep -i 'change' /masterha/app1/app1.log
  Wed Mar2 19:09:23 2016 - All other slaves should start replication from here. Statement should be: CHANGE MASTER TO MASTER_HOST='192.168.127.102', MASTER_PORT=3306, MASTER_LOG_FILE='mysql-bin.000003', MASTER_LOG_POS=981, MASTER_USER='repl', MASTER_PASSWORD='xxx';
  Wed Mar2 19:09:23 2016 - Executed CHANGE MASTER.
11. 修复master变成从库  在master(192.168.127.101)操作如下:
  # /etc/init.d/mysql start
  Starting MySQL (Percona Server)..                        
  # mysql
  Welcome to the MySQL monitor.Commands end with ; or \g.
  Your MySQL connection id is 1
  Server version: 5.6.27-75.0-log Percona Server (GPL), Release 75.0, Revision 8bb53b6
  
  Copyright (c) 2009-2015 Percona LLC and/or its affiliates
  Copyright (c) 2000, 2015, Oracle and/or its affiliates. All rights reserved.
  
  Oracle is a registered trademark of Oracle Corporation and/or its
  affiliates. Other names may be trademarks of their respective
  owners.
  
  Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.
  
  root@(none) 07:26:45>CHANGE MASTER TO MASTER_HOST='192.168.127.102', MASTER_PORT=3306, MASTER_LOG_FILE='mysql-bin.000003', MASTER_LOG_POS=981, MASTER_USER='repl', MASTER_PASSWORD='repl';
  Query OK, 0 rows affected, 2 warnings (0.06 sec)
  
  root@(none) 07:26:47>start slave;
  Query OK, 0 rows affected (0.02 sec)
  root@(none) 07:26:49>show slave status\G;
  *************************** 1. row ***************************
                 Slave_IO_State: Waiting for master to send event
                    Master_Host: 192.168.127.102
                    Master_User: repl
                    Master_Port: 3306
                  Connect_Retry: 60
              Master_Log_File: mysql-bin.000003
            Read_Master_Log_Pos: 981
                 Relay_Log_File: mysqld-relay-bin.000002
                  Relay_Log_Pos: 283
        Relay_Master_Log_File: mysql-bin.000003
               Slave_IO_Running: Yes
              Slave_SQL_Running: Yes
              Replicate_Do_DB:
            Replicate_Ignore_DB:
           Replicate_Do_Table:
         Replicate_Ignore_Table:
        Replicate_Wild_Do_Table:
  Replicate_Wild_Ignore_Table:
                     Last_Errno: 0
                     Last_Error:
                 Skip_Counter: 0
            Exec_Master_Log_Pos: 981
              Relay_Log_Space: 457
              Until_Condition: None
                 Until_Log_File:
                  Until_Log_Pos: 0
           Master_SSL_Allowed: No
           Master_SSL_CA_File:
           Master_SSL_CA_Path:
              Master_SSL_Cert:
              Master_SSL_Cipher:
                 Master_SSL_Key:
        Seconds_Behind_Master: 0
  Master_SSL_Verify_Server_Cert: No
                  Last_IO_Errno: 0
                  Last_IO_Error:
                 Last_SQL_Errno: 0
                 Last_SQL_Error:
  Replicate_Ignore_Server_Ids:
               Master_Server_Id: 102
                    Master_UUID: 1bb38a96-e066-11e5-84cb-000c2976ee35
               Master_Info_File: /data/mysql3306/master.info
                    SQL_Delay: 0
            SQL_Remaining_Delay: NULL
        Slave_SQL_Running_State: Slave has read all relay log; waiting for the slave I/O thread to update it
           Master_Retry_Count: 86400
                    Master_Bind:
        Last_IO_Error_Timestamp:
     Last_SQL_Error_Timestamp:
                 Master_SSL_Crl:
           Master_SSL_Crlpath:
           Retrieved_Gtid_Set:
              Executed_Gtid_Set:
                  Auto_Position: 0
  1 row in set (0.00 sec)
  
  ERROR:
  No query specified
12. 开启新的MHA Manager监控  # cd /etc/masterha/
  # cp app1.cnf app2.cnf
  修改配置如下,注意:红色是修改地方
  # viapp2.cnf
  
  manager_workdir=/masterha/app1
  manager_log=/masterha/app1/app1.log
  master_ip_failover_script=/usr/local/bin/master_ip_failover
  master_ip_online_change_script=/usr/local/bin/master_ip_online_change
  
  user=root
  password=123456
  ssh_user=root
  repl_user=repl
  repl_password=repl
  ping_interval=1
  remote_workdir=/tmp
  report_script=/usr/local/bin/send_report
  secondary_check_script=/usr/bin/masterha_secondary_check-s master-s slave01--user=root --master_host=slave01 --master_ip=192.168.127.102 --master_port=3306 --password=123456
  shutdown_script=""
  report_script=""
  
  
  
  hostname=192.168.127.102
  master_binlog_dir=/data/mysql3306
  candidate_master=1
  
  hostname=192.168.127.101
  master_binlog_dir=/data/mysql3306
  candidate_master=1
  check_repl_delay=0
  
  
  hostname=192.168.127.103
  master_binlog_dir=/data/mysql3306
  no_master=1
  
  查看
  # masterha_check_status --conf=/etc/masterha/app1.cnf
  app1 is stopped(2:NOT_RUNNING).
  启动新的MHA监控
  # nohup masterha_manager --conf=/etc/masterha/app2.cnf > /masterha/app1/manager.log</dev/null 2>&1 &
   2089
  查看启动状态
  # masterha_check_status --conf=/etc/masterha/app2.cnf
  app2 (pid:2089) is running(0:PING_OK), master:192.168.127.102
  
  以上测试成功,为了保证稳定,反复测试一下。
  
13.MHA+半同步复制  为了保证数据一致性采用半同步复制
  (1)Master(192.168.127.101),slave01(192.168.127.102)操作如下:
  执行安装相关插入件启动半同步复制
  INSTALL PLUGIN rpl_semi_sync_master SONAME 'semisync_master.so';
  SET GLOBAL rpl_semi_sync_master_enabled=1;
  SET GLOBAL rpl_semi_sync_master_timeout=10000;
  切换时也可能当作从库,所以也操作如下步骤
  INSTALL PLUGIN rpl_semi_sync_slave SONAME 'semisync_slave.so';
  SET GLOBAL rpl_semi_sync_slave_enabled=1;
  
  在配置文件my.cnf增加以下参数
  #############半同步###########
  rpl_semi_sync_master_enabled=1
  rpl_semi_sync_master_timeout=1000
  rpl_semi_sync_master_trace_level=32
  rpl_semi_sync_master_wait_no_slave=on
  
  rpl_semi_sync_slave_enabled=1
  #################################
  
  (2)       所以的从都操作如下:
Slave02(192.168.127.102)的操作
  执行安装相关插入件启动半同步复制
  INSTALL PLUGIN rpl_semi_sync_slave SONAME 'semisync_slave.so';
  SET GLOBAL rpl_semi_sync_slave_enabled=1;
  在配置文件my.cnf增加以下参数
  #############半同步###########
  rpl_semi_sync_slave_enabled=1
  #################################
  
  以上配置成功,不需要重启
  
  查看主库的半同步
  root@(none) 11:36:36>show variables like 'rpl%';
  +------------------------------------+----------+
  | Variable_name                      | Value    |
  +------------------------------------+----------+
  | rpl_semi_sync_master_enabled       | ON       |
  | rpl_semi_sync_master_timeout       | 10000    |
  | rpl_semi_sync_master_trace_level   | 32       |
  | rpl_semi_sync_master_wait_no_slave | ON       |
  | rpl_semi_sync_slave_enabled      | ON       |
  | rpl_semi_sync_slave_trace_level    | 32       |
  | rpl_stop_slave_timeout             | 31536000 |
  +------------------------------------+----------+
  7 rows in set (0.01 sec)
  
  查看从库的半同步
  
  root@(none) 11:36:36>show variables like 'rpl%';
  +---------------------------------+----------+
  | Variable_name                   | Value    |
  +---------------------------------+----------+
  | rpl_semi_sync_slave_enabled   | ON       |
  | rpl_semi_sync_slave_trace_level | 32       |
  | rpl_stop_slave_timeout          | 31536000 |
  +---------------------------------+----------+
  3 rows in set (0.01 sec)
  

  
页: [1]
查看完整版本: Percona-mysql MHA高可用实战方案