• Red Hat cluster fails to validate (CentOS 6.4)

    From Iain@1:0/0 to All on Tue Sep 10 10:11:01 2013
    Hi.

    I've set up a Red Hat cluster on CentOS 6.4 by doing a minimal install, yum update, then installing some generic packages to make the system more useful:

    yum install bind-utils dhcp dos2unix lsb man-pages man-pages-overrides nano nmap ntp rsync tcpdump unix2dos vim-enhanced wget

    I then install the Cluster Suite:

    yum install rgmanager ccs

    And create a config from scratch using CCS:

    ccs -h ha-01 --createcluster test-ha
    ccs -h ha-01 --addnode ha-01
    ccs -h ha-01 --addnode ha-02
    ccs -h ha-01 --addresource ip address=10.1.1.3 monitor_link=1
    ccs -h ha-01 --addresource ip address=10.1.1.4 monitor_link=1
    ccs -h ha-01 --addresource ip address=10.110.0.3 monitor_link=1
    ccs -h ha-01 --addresource ip address=10.110.8.3 monitor_link=1
    ccs -h ha-01 --addservice routing-a autostart=1 recovery=restart
    ccs -h ha-01 --addservice routing-b autostart=1 recovery=restart
    ccs -h ha-01 --addsubservice routing-a ip ref=10.1.1.3
    ccs -h ha-01 --addsubservice routing-a ip ref=10.110.0.3
    ccs -h ha-01 --addsubservice routing-b ip ref=10.1.1.4
    ccs -h ha-01 --addsubservice routing-b ip ref=10.110.8.3

    The following config results:

    <?xml version="1.0"?>
    <cluster config_version="13" name="test-ha">
    <fence_daemon/>
    <clusternodes>
    <clusternode name="ha-01" nodeid="1"/>
    <clusternode name="ha-02" nodeid="2"/>
    </clusternodes>
    <cman/>
    <fencedevices/>
    <rm>
    <failoverdomains/>
    <resources>
    <ip address="10.1.1.3" monitor_link="1"/>
    <ip address="10.1.1.4" monitor_link="1"/>
    <ip address="10.110.0.3" monitor_link="1"/>
    <ip address="10.110.8.3" monitor_link="1"/>
    </resources>
    <service autostart="1" name="routing-a" recovery="restart">
    <ip ref="10.1.1.3"/>
    <ip ref="10.110.0.3"/>
    </service>
    <service autostart="1" name="routing-b" recovery="restart">
    <ip ref="10.1.1.4"/>
    <ip ref="10.110.8.3"/>
    </service>
    </rm>
    </cluster>

    At this point, back in July, I could do a service cman start, and it would all come up. I had failover working with another box in the same configuration.

    Since August, I get the following error:

    Relax-NG validity error : Extra element rm in interleave
    tempfile:10: element rm: Relax-NG validity error : Element cluster failed to validate content
    Configuration fails to validate

    Does anyone know what's going on, please?

    Thanks,

    Iain.

    --- MBSE BBS v1.0.0 (GNU/Linux-i386)
    * Origin: The Kofo System II BBS telnet://fido2.kofobbs.