机器简介:
monitor 192.168.146.131 监控服务器
master-001 192.168.146.129 读写主机001
master-002 192.168.146.130 读写主机002
slave-001 192.168.146.131 只读主机001
虚拟IP分配
master-001 192.168.146.135 writer
master-002 192.168.146.136 reader
slave-001 192.168.146.137 reader
三台相同操作:
关闭防火墙
一,安装mysql
二,mysql 权限赋值
GRANT REPLICATION SLAVE ON *.* TO 'myrepl'@'192.168.146.%' IDENTIFIED BY '123456';
GRANT REPLICATION CLIENT ON *.* TO 'mmm_monitor'@'192.168.146.%' IDENTIFIED BY 'monitor';
GRANT SUPER, REPLICATION CLIENT, PROCESS ON *.* TO 'mmm_agent'@'192.168.146.%' IDENTIFIED BY 'agent';
二,安装MMM
安装mysql-MMM
安装依赖包:
yum install epel-release
yum install perl-Log-Log4perl-Appender-Socket-UNIX
rpm 安装
rpm -ivh mysql-mmm-2.2.1-2.el6.noarch.rpm
安装mysql-agent
安装依赖包:
yum install -y perl-Algorithm-Diff perl-Class-Singleton perl-Proc-Daemon perl-Net-ARP perl-DBD-mysql
rpm 安装
rpm -ivh mysql-mmm-agent-2.2.1-2.el6.noarch.rpm
chkconfig mysql-mmm-agent on
(131上)monitor 机器上安装monitor包
rpm -ivh mysql-mmm-monitor-2.2.1-2.el6.noarch.rpm
三.配置MMM
修改my.cnf
db1
server-id= 1
gtid_mode= on
log_slave_updates=1 #当一个主故障,另一个立即接管
enforce_gtid_consistency= 1
auto-increment-increment = 2
auto-increment-offset = 1
replicate-ignore-db=mysql,information_schema #不同步的数据库,多个写多行
db2
server-id= 2
gtid_mode= on
log_slave_updates=1 #当一个主故障,另一个立即接管
enforce_gtid_consistency= 1
auto-increment-increment = 2
auto-increment-offset = 2
replicate-ignore-db=mysql,information_schema #不同步的数据库,多个写多行
db3
server-id= 3
gtid_mode= on
log_slave_updates=1 #当一个主故障,另一个立即接管
enforce_gtid_consistency= 1
replicate-ignore-db=mysql,information_schema #不同步的数据库,多个写多行
read_only=1
配置主从
130 和131 上操作
CHANGE MASTER TO MASTER_HOST='192.168.146.129',MASTER_USER='myrepl',MASTER_PASSWORD='123456',MASTER_PORT=3306,MASTER_LOG_FILE='mysql-bin.000022', MASTER_CONNECT_RETRY=311;
start slave;
129上操作
130 和131 上操作
CHANGE MASTER TO MASTER_HOST='192.168.146.130',MASTER_USER='myrepl',MASTER_PASSWORD='123456',MASTER_PORT=3306,MASTER_LOG_FILE='mysql-bin.000032', MASTER_CONNECT_RETRY=234;
start slave;
配置/etc/mysql-mmm/mmm_common.conf #三机器统一
active_master_role writer
<host default>
cluster_interface eth0
pid_path /var/run/mysql-mmm/mmm_agentd.pid
bin_path /usr/libexec/mysql-mmm/
replication_user myrepl
replication_password 123456
agent_user mmm_agent
agent_password agent
</host>
<host db1>
ip 192.168.146.129
mode master
peer db2
</host>
<host db2>
ip 192.168.146.130
mode master
peer db1
</host>
<host db3>
ip 192.168.146.131
mode slave
</host>
<role writer>
hosts db1, db2
ips 192.168.146.135
mode exclusive
</role>
<role reader>
hosts db1, db2, db3
ips 192.168.146.136, 192.168.146.137, 192.168.146.138
mode balanced
</role>
配置/etc/mysql-mmm/mmm_agent.conf
this db1 #是谁就写谁
管理机上配置 mmm_mon.conf
include mmm_common.conf
<monitor>
ip 127.0.0.1
pid_path /var/run/mysql-mmm/mmm_mond.pid
bin_path /usr/libexec/mysql-mmm
status_path /var/lib/mysql-mmm/mmm_mond.status
ping_ips 192.168.146.129,192.168.146.130 #其他机器IP
auto_set_online 60
# The kill_host_bin does not exist by default, though the monitor will
# throw a warning about it missing. See the section 5.10 "Kill Host
# Functionality" in the PDF documentation.
#
# kill_host_bin /usr/libexec/mysql-mmm/monitor/kill_host
#
</monitor>
<host default>
monitor_user mmm_monitor
monitor_password monitor
</host>
debug 0
加入自启动:
chkconfig mysql-mmm-monitor on
chkconfig mysql-mmm-agent on
开启服务:
启动 mmm_agent
service mysql-mmm-agent start
启动 mmm_monitor
service mysql-mmm-monitor start
管理命令
[root@mysqlDB03 mysql-mmm]# mmm_control --help
Invalid command '--help'
Valid commands are:
help - show this message
ping - ping monitor
show - show status
checks [<host>|all [<check>|all]] - show checks status
set_online <host> - set host <host> online
set_offline <host> - set host <host> offline
mode - print current mode.
set_active - switch into active mode.
set_manual - switch into manual mode.
set_passive - switch into passive mode.
move_role [--force] <role> <host> - move exclusive role <role> to host <host>
(Only use --force if you know what you are doing!)
set_ip <ip> <host> - set role with ip <ip> to host <host>
[root@mysqlDB03 mysql-mmm]# mmm_control show
db1(192.168.146.129) master/ONLINE. Roles: reader(192.168.146.138), writer(192.168.146.135)
db2(192.168.146.130) master/ONLINE. Roles: reader(192.168.146.137)
db3(192.168.146.131) slave/ONLINE. Roles: reader(192.168.146.136
[root@mysqlDB03 mysql-mmm]# mmm_control checks all
db2 ping [last change: 2016/11/10 19:18:54] OK
db2 mysql [last change: 2016/11/10 19:18:54] OK
db2 rep_threads [last change: 2016/11/10 19:18:54] OK
db2 rep_backlog [last change: 2016/11/10 19:18:54] OK: Backlog is null
db3 ping [last change: 2016/11/10 19:18:54] OK
db3 mysql [last change: 2016/11/10 19:19:17] OK
db3 rep_threads [last change: 2016/11/10 19:19:17] OK
db3 rep_backlog [last change: 2016/11/10 19:19:17] OK: Backlog is null
db1 ping [last change: 2016/11/10 19:18:54] OK
db1 mysql [last change: 2016/11/10 19:18:54] OK
db1 rep_threads [last change: 2016/11/10 19:18:54] OK
db1 rep_backlog [last change: 2016/11/10 19:18:54] OK: Backlog is null
测试转换:
DB1上操作:
[root@mysqlDB01 mysql-mmm]# mysqladmin -uroot -p -S /tmp/mysql.sock shutdown
Enter password:
管理机器上查看:
Bye
[root@mysqlDB03 mysql-mmm]# mmm_control checks all
db2 ping [last change: 2016/11/10 19:18:54] OK
db2 mysql [last change: 2016/11/10 19:18:54] OK
db2 rep_threads [last change: 2016/11/10 19:18:54] OK
db2 rep_backlog [last change: 2016/11/10 19:18:54] OK: Backlog is null
db3 ping [last change: 2016/11/10 19:18:54] OK
db3 mysql [last change: 2016/11/10 19:39:20] OK
db3 rep_threads [last change: 2016/11/10 19:39:20] OK
db3 rep_backlog [last change: 2016/11/10 19:19:17] OK: Backlog is null
db1 ping [last change: 2016/11/10 19:18:54] OK
db1 mysql [last change: 2016/11/10 19:31:34] ERROR: Connect error (host = 192.168.146.129:3306, user = mmm_monitor)! Lost connection to MySQL server at 'reading initial communication packet', system error: 111
db1 rep_threads [last change: 2016/11/10 19:18:54] OK
db1 rep_backlog [last change: 2016/11/10 19:18:54] OK: Backlog is null
[root@mysqlDB03 mysql-mmm]# mmm_control show
db1(192.168.146.129) master/HARD_OFFLINE. Roles:
db2(192.168.146.130) master/ONLINE. Roles: reader(192.168.146.137), writer(192.168.146.135)
db3(192.168.146.131) slave/ONLINE. Roles: reader(192.168.146.136), reader(192.168.146.138)
启动DB1 等待60s再看:
[root@mysqlDB03 mysql-mmm]# mmm_control show
db1(192.168.146.129) master/ONLINE. Roles: reader(192.168.146.136)
db2(192.168.146.130) master/ONLINE. Roles: reader(192.168.146.137), writer(192.168.146.135)
db3(192.168.146.131) slave/ONLINE. Roles: reader(192.168.146.138)
我们想把writer角色切回到db1上:
[root@mysqlDB03 mysql-mmm]# mmm_control move_role writer db1
OK: Role 'writer' has been moved from 'db2' to 'db1'. Now you can wait some time and check new roles info!
[root@mysqlDB03 mysql-mmm]# mmm_control show
db1(192.168.146.129) master/ONLINE. Roles: reader(192.168.146.136), writer(192.168.146.135)
db2(192.168.146.130) master/ONLINE. Roles: reader(192.168.146.137)
db3(192.168.146.131) slave/ONLINE. Roles: reader(192.168.146.138)
遇到两个问题
1. 监控程序服务器无法启动
在管理服务器上,一切都完成后,通过mmm_control查看状态,得到下面的错误信息:
ERROR: Cant connect to monitor daemon! 通过编辑/etc/mysql-mmm/mmm_mon.conf文件将debug 0改为debug 1,打开监控程序的debug状态。重新启动监控程序(service mysql-mmm-monitor restart),就会看到详细的错误信息,找不到Perl Time HiRes库。执行yum -y install perl-Time-HiRes*就可以解决。
2. 防火墙问题导致Warning: agent on host db1 is not reachable.
控制台程序正确启动后,再次执行mmm_control show,却看到下面的输出:
# Warning: agent on host db1 is not reachable
# Warning: agent on host db2 is not reachable
db1(192.168.146.129) master/ONLINE. Roles:
db2(192.168.146.130) master/ONLINE. Roles:
db3(192.168.146.131) slave/ONLINE. Roles: reader(192.168.146.138)
再次打开debug,查看错误信息
通过telnet 192.168.146.130 9989下面检查网络连接,得到了No route to host的错误信息。登录db1,关闭iptables(这不是一个好主意)。同样,关闭db2、db3上的防火墙,再次重启监控程序,一切回到正常状态!