mengfei888 发表于 2015-7-2 16:37:24

Ceph 三节点安装配置


ceph安装配置:

由于以下环境是在装有openstack的三节点上安装的,故主机名是:controller/compute/network
但是配置ceph,可以添加mon、mds、osd、client等名称到/etc/hosts里

注:使用mon0 osd0 osd1主机名在后边创建过程中,提到和远程hostname不匹配,OSD节点id是从0开始,所以主机节点可从osd0开始
    故仍使用原openstack三节点为的主机名mon0=compute osd0=controller osd1=network


(一)环境准备
1、节点IP
192.168.128.100(hostname controller,有一个分区/dev/sdc1 提供给osd)
192.168.128.102(hostname network,有一个分区/dev/sdc1 提供给osd)
192.168.128.101(hostname compute,有一个分区/dev/sdc1 提供给osd)

2、修改所有主机的/etc/hosts
   
#controller
192.168.128.100 controller swift1 osd0
#compute
192.168.128.101 compute mon0 osd2 client
#network
192.168.128.102 network swift2 osd1

3、在所有node上创建ceph用户(注:主机上创建了ceph用户,为方便管理,还是决定使用mengfei用户)
sudo useradd -d /home/ceph -m ceph
sudo passwd ceph

4、在每个Ceph节点中为用户增加 root 权限
echo "ceph ALL = (root) NOPASSWD:ALL" | sudo tee /etc/sudoers.d/ceph
sudo chmod 0440 /etc/sudoers.d/ceph

echo "mengfei ALL = (root) NOPASSWD:ALL" | sudo tee /etc/sudoers.d/mengfei
sudo chmod 0440 /etc/sudoers.d/mengfei

    permission denied (publickey)解决方法
       修改root密码后,依然拒绝root密码登录,解决方法如下:
    ssh出现permission denied (publickey)问题:
      修改/etc/ssh/sshd_config文件.
    将其中的PermitRootLogin no修改为yes
    PubkeyAuthentication yes注:不能改为no,否则在key无法使用
    AuthorizedKeysFile .ssh/authorized_keys前面加上#屏蔽掉,
    PasswordAuthentication no注释#掉
    重启sshd即可:service sshd restart


5、配置ceph-deploy部署的无密码登录每个ceph节点   
(1)在每个Ceph节点上安装一个SSH服务器
apt-get install openssh-server -y
   
(2)配置compute管理节点与每个Ceph节点无密码的SSH访问。(使用不同的用户,要在不同用户下建key)
   root@compute:~/.ssh# ssh-keygen
   Generating public/private rsa key pair.
   Enter file in which to save the key (/root/.ssh/id_rsa):
   Enter passphrase (empty for no passphrase):
   Enter same passphrase again:
   Your identification has been saved in /root/.ssh/id_rsa.
   Your public key has been saved in /root/.ssh/id_rsa.pub.
   The key fingerprint is:
   25:6e:7c:8f:ea:4b:f1:a5:e6:f5:e7:30:fe:79:bf:08 root@compute
   The key's randomart image is:
   +--[ RSA 2048]----+
   |               |
   |               |
   |      . .      |
   |       o o       |
   |      S . .    |
   |       . + =   |
   |      . =Eo o|
   |       . + ..o.o+|
   |       .+...o*B|
   +-----------------+
   root@compute:~/.ssh#

(3)复制mon节点的秘钥到每个ceph节点(要指定要使用的用户名)
ssh-copy-id mengfei@compute
ssh-copy-id mengfei@controller
ssh-copy-id mengfei@network

ssh-copy-id root@compute
ssh-copy-id root@controller
ssh-copy-id root@network


   root@compute:~/.ssh# ssh-copy-id mengfei@controller
   /usr/bin/ssh-copy-id: INFO: attempting to log in with the new key(s), to filter out any that are already installed
   /usr/bin/ssh-copy-id: INFO: 1 key(s) remain to be installed -- if you are prompted now it is to install the new keys
   mengfei@controller's password:
   
   Number of key(s) added: 1
   
   Now try logging into the machine, with:   "ssh 'mengfei@controller'"
   and check to make sure that only the key(s) you wanted were added.
   
   root@compute:~/.ssh#


(4)测试每台ceph节点不用密码是否可以登录
ssh mengfei@controller
ssh mengfei@network
ssh root@controller
ssh root@network

   root@compute:~/.ssh# ssh mengfei@controller                     
   Welcome to Ubuntu 14.04.1 LTS (GNU/Linux 3.13.0-39-generic i686)
                                                                     
      * Documentation:https://help.ubuntu.com/                     
                                                                     
   19 packages can be updated.                                    
   15 updates are security updates.                              
                                                                     
   Last login: Tue Nov 25 14:07:06 2014 from compute               
   mengfei@controller:~$

(5)(Recommended) Modify the ~/.ssh/config file of your ceph-deploy admin node
   so that ceph-deploy can log in to Ceph nodes as the user you created without requiring you to specify --username {username} each time you execute ceph-deploy. This has the added benefit of streamlining ssh and scp usage. Replace {username} with the user name you created:
   注:实例中没有添加,config默认貌似没有这个文件。
Host controller
   Hostname controller
   User mengfei
Host node2
   Hostname network
   User mengfei


(二)安装ceph-deploy
1、添加release key
wget -q -O- 'https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc' | sudo apt-key add -

2、添加Ceph包到你的仓库,用一个稳定的Ceph发行版替换{ceph-stable-release}(如 cuttlefish, dumpling等)
   实例:echo deb http://ceph.com/debian-{ceph-stable-release}/ $(lsb_release -sc) main | sudo tee /etc/apt/sources.list.d/ceph.list

echo deb http://ceph.com/debian-dumpling/ $(lsb_release -sc) main | sudo tee /etc/apt/sources.list.d/ceph.list

3、更新源,并安装ceph-deploy
apt-get update
apt-get install ceph-deploy

4、安装ntp (省略)


(三)安装配置ceph cluter

1、为了获得最佳效果,你的admin维护配置您的群集节点上创建一个目录。
mkdir my-cluster
mkdir /etc/ceph
cd my-cluster


2、创建一个集群

(1)要创建您的Ceph的存储集群,生成一个文件系统ID(FSID),在命令行提示符下输入以下命令,生成监视器的秘钥
ceph-deploy purgedata compute controller network
ceph-deploy forgetkeys

   root@compute:/home/mengfei/my-cluster# ceph-deploy purgedata compute controller network
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy purgedata compute controller network
    Purging data from cluster ceph hosts compute controller network
    connected to host: compute
    detect platform information from remote host
    detect machine type
    find the location of an executable
    connected to host: controller
    detect platform information from remote host
    detect machine type
    find the location of an executable
    connected to host: network
    detect platform information from remote host
    detect machine type
    find the location of an executable
    connected to host: compute
    detect platform information from remote host
    detect machine type
    Distro info: Ubuntu 14.04 trusty
    purging data on compute
    Running command: rm -rf --one-file-system -- /var/lib/ceph
    Running command: rm -rf --one-file-system -- /etc/ceph/
    connected to host: controller
    detect platform information from remote host
    detect machine type
    Distro info: Ubuntu 14.04 trusty
    purging data on controller
    Running command: rm -rf --one-file-system -- /var/lib/ceph
    Running command: rm -rf --one-file-system -- /etc/ceph/
    connected to host: network
    detect platform information from remote host
    detect machine type
    Distro info: Ubuntu 14.04 trusty
    purging data on network
    Running command: rm -rf --one-file-system -- /var/lib/ceph
    Running command: rm -rf --one-file-system -- /etc/ceph/
   root@compute:/home/mengfei/my-cluster#
         
   root@compute:/home/mengfei/my-cluster# ceph-deploy forgetkeys
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy forgetkeys
   root@compute:/home/mengfei/my-cluster#



(2)在管理模式下,请使用ceph-deploy创建集群
    注:当前目录下会生成ceph.conf ceph.mon.keyring ceph.log 配置文件,密钥环,日志文件
cd /home/mengfei/my-cluster
ceph-deploy new compute

    (注:应该先在主节点上先创建集群ceph,应该先new compute创建,后边再install)

       root@compute:/home/mengfei/my-cluster# ceph-deploy new compute
       found configuration file at: /root/.cephdeploy.conf
       Invoked (1.5.20): /usr/bin/ceph-deploy new compute
       Creating new cluster named ceph
       making sure passwordless SSH succeeds
       connected to host: compute
       detect platform information from remote host
       detect machine type
       find the location of an executable
       Running command: /bin/ip link show
       Running command: /bin/ip addr show
       IP addresses found: ['192.168.122.1', '192.168.128.101', '10.10.10.101']
       Resolving host compute
       Monitor compute at 192.168.128.101
       Monitor initial members are ['compute']
       Monitor addrs are ['192.168.128.101']
       Creating a random mon key...
       Writing monitor keyring to ceph.mon.keyring...
       Writing initial config to ceph.conf...
       root@compute:/home/mengfei/my-cluster#


(3)安装Ceph
ceph-deploy install compute controller network

ceph-deploy uninstall compute controller network    如果需要重装,可以此两条命令删除ceph
apt-get remove --purge ceph ceph-common ceph-mds
      
      root@compute:/home/mengfei/my-cluster# ceph-deploy install compute controller network
       found configuration file at: /root/.cephdeploy.conf
       Invoked (1.5.20): /usr/bin/ceph-deploy install compute controller network
       Installing stable version firefly on cluster ceph hosts compute controller network
       Detecting platform for host compute ...
       connected to host: compute
       detect platform information from remote host
       detect machine type
       Distro info: Ubuntu 14.04 trusty
       installing ceph on compute
       Running command: env DEBIAN_FRONTEND=noninteractive apt-get -q install --assume-yes ca-certificates
       Reading package lists...
       Building dependency tree...
       Reading state information...
       ca-certificates is already the newest version.
       0 upgraded, 0 newly installed, 0 to remove and 48 not upgraded.
       Running command: wget -O release.asc https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
       --2014-11-26 17:36:27--https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
       Resolving ceph.com (ceph.com)... 208.113.241.137, 2607:f298:4:147::b05:fe2a
       Connecting to ceph.com (ceph.com)|208.113.241.137|:443... connected.
       HTTP request sent, awaiting response... 200 OK
       Length: unspecified
       Saving to: ‘release.asc’
      
            0K .                                                      19.7M=0s
      
       2014-11-26 17:36:28 (19.7 MB/s) - ‘release.asc’ saved
      
       Running command: apt-key add release.asc
       OK
       add deb repo to sources.list
       Running command: apt-get -q update
       Ign http://cn.archive.ubuntu.com trusty InRelease
       Hit http://ceph.com trusty InRelease
       Ign http://cn.archive.ubuntu.com trusty-updates InRelease
       Ign http://ubuntu-cloud.archive.canonical.com trusty-updates/juno InRelease
       Ign http://cn.archive.ubuntu.com trusty-backports InRelease
       Hit http://ceph.com trusty/main i386 Packages
       Hit http://ubuntu-cloud.archive.canonical.com trusty-updates/juno Release.gpg
       Ign http://cn.archive.ubuntu.com trusty-security InRelease
       Hit http://ubuntu-cloud.archive.canonical.com trusty-updates/juno Release
      ..................一系列包的源网址,太长了,就省略显示了。。
       Ign http://cn.archive.ubuntu.com trusty/universe Translation-en_US
       Reading package lists...
       Running command: env DEBIAN_FRONTEND=noninteractive DEBIAN_PRIORITY=critical apt-get -q -o Dpkg::Options::=--force-confnew --no-install-recommends --assume-yes install -- ceph ceph-mds ceph-common ceph-fs-common gdisk
       Reading package lists...
       Building dependency tree...
       Reading state information...
       gdisk is already the newest version.
       ceph is already the newest version.
       ceph-common is already the newest version.
       ceph-fs-common is already the newest version.
       ceph-mds is already the newest version.
       0 upgraded, 0 newly installed, 0 to remove and 48 not upgraded.
       Running command: ceph --version
       ceph version 0.80.7 (6c0127fcb58008793d3c8b62d925bc91963672a3)
       Detecting platform for host controller ...
       connected to host: controller
       detect platform information from remote host
       detect machine type
       Distro info: Ubuntu 14.04 trusty
       installing ceph on controller
       Running command: env DEBIAN_FRONTEND=noninteractive apt-get -q install --assume-yes ca-certificates
       Reading package lists...
       Building dependency tree...
       Reading state information...
       ca-certificates is already the newest version.
       0 upgraded, 0 newly installed, 0 to remove and 17 not upgraded.
       Running command: wget -O release.asc https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
       --2014-11-26 17:37:17--https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
       Resolving ceph.com (ceph.com)... 208.113.241.137, 2607:f298:4:147::b05:fe2a
       Connecting to ceph.com (ceph.com)|208.113.241.137|:443... connected.
       HTTP request sent, awaiting response... 200 OK
       Length: unspecified
       Saving to: ‘release.asc’
      
            0K .                                                      34.4M=0s
      
       2014-11-26 17:37:19 (34.4 MB/s) - ‘release.asc’ saved
      
       Running command: apt-key add release.asc
       OK
       add deb repo to sources.list
       Running command: apt-get -q update
       Hit http://ceph.com trusty InRelease
       Ign http://downloads-distro.mongodb.org dist InRelease
      ..................一系列包的源网址,太长了,就省略显示了。。
       Ign http://cn.archive.ubuntu.com trusty/restricted Translation-en_US
       Ign http://cn.archive.ubuntu.com trusty/universe Translation-en_US
       Fetched 361 kB in 27s (13.1 kB/s)
       Reading package lists...
       W: GPG error: http://downloads-distro.mongodb.org dist Release: The following signatures couldn't be verified because the public key is not available: NO_PUBKEY 9ECBEC467F0CEB10
       Running command: env DEBIAN_FRONTEND=noninteractive DEBIAN_PRIORITY=critical apt-get -q -o Dpkg::Options::=--force-confnew --no-install-recommends --assume-yes install -- ceph ceph-mds ceph-common ceph-fs-common gdisk
       Reading package lists...
       Building dependency tree...
       Reading state information...
       gdisk is already the newest version.
       ceph is already the newest version.
       ceph-common is already the newest version.
       ceph-fs-common is already the newest version.
       ceph-mds is already the newest version.
       0 upgraded, 0 newly installed, 0 to remove and 17 not upgraded.
       Running command: ceph --version
       ceph version 0.80.7 (6c0127fcb58008793d3c8b62d925bc91963672a3)
       Detecting platform for host network ...
       connected to host: network
       detect platform information from remote host
       detect machine type
       Distro info: Ubuntu 14.04 trusty
       installing ceph on network
       Running command: env DEBIAN_FRONTEND=noninteractive apt-get -q install --assume-yes ca-certificates
       Reading package lists...
       Building dependency tree...
       Reading state information...
       ca-certificates is already the newest version.
       0 upgraded, 0 newly installed, 0 to remove and 36 not upgraded.
       Running command: wget -O release.asc https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
       --2014-11-26 17:37:52--https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
       Resolving ceph.com (ceph.com)... 208.113.241.137, 2607:f298:4:147::b05:fe2a
       Connecting to ceph.com (ceph.com)|208.113.241.137|:443... connected.
       HTTP request sent, awaiting response... 200 OK
       Length: unspecified
       Saving to: ‘release.asc’
      
            0K .                                                      16.1M=0s
      
       2014-11-26 17:38:02 (16.1 MB/s) - ‘release.asc’ saved
      
       Running command: apt-key add release.asc
       OK
       add deb repo to sources.list
       Running command: apt-get -q update
       Hit http://ceph.com trusty InRelease
       Ign http://security.ubuntu.com trusty-security InRelease
      ..................一系列包的源网址,太长了,就省略显示了。。
       Ign http://cn.archive.ubuntu.com trusty/restricted Translation-en_US
       Ign http://cn.archive.ubuntu.com trusty/universe Translation-en_US
       Fetched 361 kB in 41s (8,704 B/s)
       Reading package lists...
       W: Duplicate sources.list entry http://cn.archive.ubuntu.com/ubuntu/ trusty/main i386 Packages (/var/lib/apt/lists/cn.archive.ubuntu.com_ubuntu_dists_trusty_main_binary-i386_Packages)
       Running command: env DEBIAN_FRONTEND=noninteractive DEBIAN_PRIORITY=critical apt-get -q -o Dpkg::Options::=--force-confnew --no-install-recommends --assume-yes install -- ceph ceph-mds ceph-common ceph-fs-common gdisk
       Reading package lists...
       Building dependency tree...
       Reading state information...
       gdisk is already the newest version.
       ceph is already the newest version.
       ceph-common is already the newest version.
       ceph-fs-common is already the newest version.
       ceph-mds is already the newest version.
       0 upgraded, 0 newly installed, 0 to remove and 36 not upgraded.
       Running command: ceph --version
       ceph version 0.80.7 (6c0127fcb58008793d3c8b62d925bc91963672a3)
      root@compute:/home/mengfei/my-cluster#

(4)增加一个Ceph集群监视器
ceph-deploy mon create compute

      root@compute:/home/mengfei/my-cluster# ceph-deploy mon create compute
       found configuration file at: /root/.cephdeploy.conf
       Invoked (1.5.20): /usr/bin/ceph-deploy mon create compute
       keyring (ceph.mon.keyring) not found, creating a new one
       Creating a random mon key...
       Writing monitor keyring to ceph.mon.keyring...
       Deploying mon, cluster ceph hosts compute
       detecting platform for host compute ...
       connected to host: compute
       detect platform information from remote host
       detect machine type
       distro info: Ubuntu 14.04 trusty
       determining if provided host has same hostname in remote
       get remote short hostname
       deploying mon to compute
       get remote short hostname
       remote hostname: compute
       write cluster configuration to /etc/ceph/{cluster}.conf
       create the mon path if it does not exist
       checking for done path: /var/lib/ceph/mon/ceph-compute/done
       done path does not exist: /var/lib/ceph/mon/ceph-compute/done
       creating keyring file: /var/lib/ceph/tmp/ceph-compute.mon.keyring
       create the monitor keyring file
       Running command: ceph-mon --cluster ceph --mkfs -i compute --keyring /var/lib/ceph/tmp/ceph-compute.mon.keyring
       ceph-mon: mon.noname-a 192.168.128.101:6789/0 is local, renaming to mon.compute
       ceph-mon: set fsid to a15c8476-cd50-4609-bfc7-bc49a5d24f8c
       ceph-mon: created monfs at /var/lib/ceph/mon/ceph-compute for mon.compute
       unlinking keyring file /var/lib/ceph/tmp/ceph-compute.mon.keyring
       create a done file to avoid re-doing the mon deployment
       create the init path if it does not exist
       locating the `service` executable...
       Running command: initctl emit ceph-mon cluster=ceph id=compute
       Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.compute.asok mon_status
       ********************************************************************************
       status for monitor: mon.compute
       {
         "election_epoch": 2,
         "extra_probe_peers": [],
         "monmap": {
         "created": "0.000000",
         "epoch": 1,
         "fsid": "a15c8476-cd50-4609-bfc7-bc49a5d24f8c",
         "modified": "0.000000",
         "mons": [
             {
               "addr": "192.168.128.101:6789/0",
               "name": "compute",
               "rank": 0
             }
         ]
         },
         "name": "compute",
         "outside_quorum": [],
         "quorum": [
         0
         ],
         "rank": 0,
         "state": "leader",
         "sync_provider": []
       }
       ********************************************************************************
       monitor: mon.compute is running
       Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.compute.asok mon_status
      root@compute:/home/mengfei/my-cluster#


(5)收集密钥
ceph-deploy gatherkeys compute
   
   一旦你收集到密钥,在本地目录下可看到如下密钥环文件:                                             
   1. {cluster-name}.client.admin.keyring         
   2. {cluster-name}.bootstrap-osd.keyring      
   3. {cluster-name}.bootstrap-mds.keyring

   root@compute:/home/mengfei/my-cluster# ceph-deploy gatherkeys compute
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy gatherkeys compute
    Checking compute for /etc/ceph/ceph.client.admin.keyring
    connected to host: compute
    detect platform information from remote host
    detect machine type
    fetch remote file
    Got ceph.client.admin.keyring key from compute.
    Have ceph.mon.keyring
    Checking compute for /var/lib/ceph/bootstrap-osd/ceph.keyring
    connected to host: compute
    detect platform information from remote host
    detect machine type
    fetch remote file
    Got ceph.bootstrap-osd.keyring key from compute.
    Checking compute for /var/lib/ceph/bootstrap-mds/ceph.keyring
    connected to host: compute
    detect platform information from remote host
    detect machine type
    fetch remote file
    Got ceph.bootstrap-mds.keyring key from compute.
   root@compute:/home/mengfei/my-cluster#


(6)创建osd目录挂载点
    注:disk是5G,这里只划出1G,剩余空间暂时留作它用。

ssh root@controller   也就是osd0

创建磁盘分区
fdisk /dev/sdc   注:下边有输出记录

创建挂载点
mkdir -p /var/lib/ceph/osd/ceph-osd0

格式化分区:荐用xfs或btrfs文件系统,命令是mkfs
mkfs.xfs -f /dev/sdc1   
mount /dev/sdc1 /var/lib/ceph/osd/ceph-osd0               注:加-o user_xattr 报错,提示bad option
mount -o remount,user_xattr /var/lib/ceph/osd/ceph-osd0   注:文件系统上添加user_xattr选项,remount不需要完全卸载文件系统
vi /etc/fstab
/dev/sdc1 /var/lib/ceph/osd/ceph-osd0 xfs defaults 0 0    注:自已添加,官方文档没此步骤
/dev/sdc1 /var/lib/ceph/osd/ceph-osd0 xfs remount,user_xattr 0 0

      root@controller:/home/mengfei#fdisk /dev/sdc
      Command (m for help): n
      Partition type:
         p   primary (0 primary, 0 extended, 4 free)
         e   extended
      Select (default p): p
      Partition number (1-4, default 1): 1
      First sector (2048-10485759, default 2048):
      Using default value 2048
      Last sector, +sectors or +size{K,M,G} (2048-10485759, default 10485759): 2097151
      Command (m for help): p
      Device Boot      Start         End      Blocks   IdSystem
      /dev/sdc1            2048   2097151   1047552   83Linux
      Command (m for help): w
      The partition table has been altered!
      Calling ioctl() to re-read partition table.
      Syncing disks.
      root@controller:/home/mengfei#
      
      root@controller:/home/mengfei# mkfs.xfs -f /dev/sdc1
      meta-data=/dev/sdc1            isize=256    agcount=4, agsize=65472 blks
               =                     sectsz=512   attr=2, projid32bit=0
      data   =                     bsize=4096   blocks=261888, imaxpct=25
               =                     sunit=0      swidth=0 blks
      naming   =version 2            bsize=4096   ascii-ci=0
      log      =internal log         bsize=4096   blocks=1200, version=2
               =                     sectsz=512   sunit=0 blks, lazy-count=1
      realtime =none                   extsz=4096   blocks=0, rtextents=0
      root@controller:/home/mengfei#
      
      
      
ssh root@network      也就是osd1

创建磁盘分区
fdisk /dev/sdc   注:下边有输出记录

创建挂载点
mkdir -p /var/lib/ceph/osd/ceph-osd1

格式化分区:荐用xfs或btrfs文件系统,命令是mkfs
mkfs.xfs -f /dev/sdc1   
mount /dev/sdc1 /var/lib/ceph/osd/ceph-osd1               注:加-o user_xattr 报错,提示bad option
mount -o remount,user_xattr /var/lib/ceph/osd/ceph-osd1   注:文件系统上添加user_xattr选项,remount不需要完全卸载文件系统
vi /etc/fstab
#/dev/sdc1 /var/lib/ceph/osd/ceph-osd1 xfs user_xattr 0 0    注:自已添加,官方文档没此步骤
/dev/sdc1 /var/lib/ceph/osd/ceph-osd1 xfs rw 0 0

      root@controller:/home/mengfei#fdisk /dev/sdc
      Command (m for help): n
      Partition type:
         p   primary (0 primary, 0 extended, 4 free)
         e   extended
      Select (default p): p
      Partition number (1-4, default 1): 1
      First sector (2048-10485759, default 2048):
      Using default value 2048
      Last sector, +sectors or +size{K,M,G} (2048-10485759, default 10485759): 2097151
      Command (m for help): p
      Disk /dev/sdc: 5368 MB, 5368709120 bytes
      255 heads, 63 sectors/track, 652 cylinders, total 10485760 sectors
      Units = sectors of 1 * 512 = 512 bytes
      Sector size (logical/physical): 512 bytes / 512 bytes
      I/O size (minimum/optimal): 512 bytes / 512 bytes
      Disk identifier: 0xfbd1ab98
      
      Device Boot      Start         End      Blocks   IdSystem
      /dev/sdc1            2048   2097151   1047552   83Linux
      
      Command (m for help): w
      The partition table has been altered!
      
      Calling ioctl() to re-read partition table.
      Syncing disks.
      root@network:/home/mengfei#
      
      root@network:/home/mengfei# mkfs.xfs -f /dev/sdc1
      meta-data=/dev/sdc1            isize=256    agcount=4, agsize=65472 blks
               =                     sectsz=512   attr=2, projid32bit=0
      data   =                     bsize=4096   blocks=261888, imaxpct=25
               =                     sunit=0      swidth=0 blks
      naming   =version 2            bsize=4096   ascii-ci=0
      log      =internal log         bsize=4096   blocks=1200, version=2
               =                     sectsz=512   sunit=0 blks, lazy-count=1
      realtime =none                   extsz=4096   blocks=0, rtextents=0
      root@network:/home/mengfei#



(7)管理模式下添加OSD节点并激活OSD

cd /home/mengfei/my-cluster      

    注:一定要到此目录下执行,因为创建集群ceph时会自动在此目录下生成ceph.conf.运行ceph-deploy时会自动分发,不在此目录下执行会提示“Cannot load config”
    有些配置是需要在my-cluster/ceph.conf修改的,比如:ceph-osd0/journal 默认可能需要很大,所以我就在my-cluter/ceph.conf做了修改:
   
    osd journal size = 100         journal大小100M,如果mount点够大,快速安装就无所谓了,我的空间小,就设定了100
    osd pool default size = 3      (配置存储对象副本数=对象+副本)   
    osd pool default min_size = 1(配置存储对象最小副本数)
    osd crush chooseleaf type = 1(使用在CRUSH规则chooseleaf斗式。使用序号名称而非军衔,默认是1)
            
ceph-deploy osd prepare controller:/var/lib/ceph/osd/ceph-osd0
ceph-deploy osd prepare network:/var/lib/ceph/osd/ceph-osd1
ceph-deploy osd activate controller:/var/lib/ceph/osd/ceph-osd0
ceph-deploy osd activate network:/var/lib/ceph/osd/ceph-osd1
      注:有时执行时会提示--overwirte-conf
   
   root@compute:/home/mengfei/my-cluster# ceph-deploy osd prepare controller:/var/lib/ceph/osd/ceph-osd0
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy --overwrite-conf osd prepare controller:/var/lib/ceph/osd/ceph-osd0
    Preparing cluster ceph disks controller:/var/lib/ceph/osd/ceph-osd0:
    connected to host: controller
    detect platform information from remote host
    detect machine type
    Distro info: Ubuntu 14.04 trusty
    Deploying osd to controller
    write cluster configuration to /etc/ceph/{cluster}.conf
    Running command: udevadm trigger --subsystem-match=block --action=add
    Preparing host controller disk /var/lib/ceph/osd/ceph-osd0 journal None activate False
    Running command: ceph-disk -v prepare --fs-type xfs --cluster ceph -- /var/lib/ceph/osd/ceph-osd0
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mkfs_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mkfs_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mount_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mount_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=osd_journal_size
    DEBUG:ceph-disk:Preparing osd data dir /var/lib/ceph/osd/ceph-osd0
    checking OSD status...
    Running command: ceph --cluster=ceph osd stat --format=json
    Host controller is now ready for osd use.
   root@compute:/home/mengfei/my-cluster#
   
   root@compute:/home/mengfei/my-cluster# ceph-deploy osd activate controller:/var/lib/ceph/osd/ceph-osd0
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy osd activate controller:/var/lib/ceph/osd/ceph-osd0
    Activating cluster ceph disks controller:/var/lib/ceph/osd/ceph-osd0:
    connected to host: controller
    detect platform information from remote host
    detect machine type
    Distro info: Ubuntu 14.04 trusty
    activating host controller disk /var/lib/ceph/osd/ceph-osd0
    will use init type: upstart
    Running command: ceph-disk -v activate --mark-init upstart --mount /var/lib/ceph/osd/ceph-osd0
    DEBUG:ceph-disk:Cluster uuid is a15c8476-cd50-4609-bfc7-bc49a5d24f8c
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
    DEBUG:ceph-disk:Cluster name is ceph
    DEBUG:ceph-disk:OSD uuid is ec6d0ec3-9c44-4bec-80cb-24709ec03ea1
    DEBUG:ceph-disk:Allocating OSD id...
    INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring osd create --concise ec6d0ec3-9c44-4bec-80cb-24709ec03ea1
    DEBUG:ceph-disk:OSD id is 4
    DEBUG:ceph-disk:Initializing OSD...
    INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring mon getmap -o /var/lib/ceph/osd/ceph-osd0/activate.monmap
    got monmap epoch 1
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster ceph --mkfs --mkkey -i 4 --monmap /var/lib/ceph/osd/ceph-osd0/activate.monmap --osd-data /var/lib/ceph/osd/ceph-osd0 --osd-journal /var/lib/ceph/osd/ceph-osd0/journal --osd-uuid ec6d0ec3-9c44-4bec-80cb-24709ec03ea1 --keyring /var/lib/ceph/osd/ceph-osd0/keyring
    2014-11-27 15:23:41.011243 b684b740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
    2014-11-27 15:23:41.261893 b684b740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
    2014-11-27 15:23:41.291819 b684b740 -1 filestore(/var/lib/ceph/osd/ceph-osd0) could not find 23c2fcde/osd_superblock/0//-1 in index: (2) No such file or directory
    2014-11-27 15:23:41.300771 b684b740 -1 created object store /var/lib/ceph/osd/ceph-osd0 journal /var/lib/ceph/osd/ceph-osd0/journal for osd.4 fsid a15c8476-cd50-4609-bfc7-bc49a5d24f8c
    2014-11-27 15:23:41.300858 b684b740 -1 auth: error reading file: /var/lib/ceph/osd/ceph-osd0/keyring: can't open /var/lib/ceph/osd/ceph-osd0/keyring: (2) No such file or directory
    2014-11-27 15:23:41.301001 b684b740 -1 created new key in keyring /var/lib/ceph/osd/ceph-osd0/keyring
    DEBUG:ceph-disk:Marking with init system upstart
    DEBUG:ceph-disk:Authorizing OSD key...
    INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring auth add osd.4 -i /var/lib/ceph/osd/ceph-osd0/keyring osd allow * mon allow profile osd
    added key for osd.4
    DEBUG:ceph-disk:ceph osd.4 data dir is ready at /var/lib/ceph/osd/ceph-osd0
    DEBUG:ceph-disk:Creating symlink /var/lib/ceph/osd/ceph-4 -> /var/lib/ceph/osd/ceph-osd0
    DEBUG:ceph-disk:Starting ceph osd.4...
    INFO:ceph-disk:Running command: /sbin/initctl emit --no-wait -- ceph-osd cluster=ceph id=4
    checking OSD status...
    Running command: ceph --cluster=ceph osd stat --format=json
   root@compute:/home/mengfei/my-cluster#
            
   root@compute:/home/mengfei/my-cluster# ceph-deploy osd prepare network:/var/lib/ceph/osd/ceph-osd1
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy osd prepare network:/var/lib/ceph/osd/ceph-osd1
    Preparing cluster ceph disks network:/var/lib/ceph/osd/ceph-osd1:
    connected to host: network
    detect platform information from remote host
    detect machine type
    Distro info: Ubuntu 14.04 trusty
    Deploying osd to network
    write cluster configuration to /etc/ceph/{cluster}.conf
    osd keyring does not exist yet, creating one
    create a keyring file
    Running command: udevadm trigger --subsystem-match=block --action=add
    Preparing host network disk /var/lib/ceph/osd/ceph-osd1 journal None activate False
    Running command: ceph-disk -v prepare --fs-type xfs --cluster ceph -- /var/lib/ceph/osd/ceph-osd1
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mkfs_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mkfs_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mount_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mount_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=osd_journal_size
    DEBUG:ceph-disk:Preparing osd data dir /var/lib/ceph/osd/ceph-osd1
    checking OSD status...
    Running command: ceph --cluster=ceph osd stat --format=json
    Host network is now ready for osd use.
   root@compute:/home/mengfei/my-cluster#

   root@compute:/home/mengfei/my-cluster# ceph-deploy osd activate network:/var/lib/ceph/osd/ceph-osd1
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy osd activate network:/var/lib/ceph/osd/ceph-osd1
    Activating cluster ceph disks network:/var/lib/ceph/osd/ceph-osd1:
    connected to host: network
    detect platform information from remote host
    detect machine type
    Distro info: Ubuntu 14.04 trusty
    activating host network disk /var/lib/ceph/osd/ceph-osd1
    will use init type: upstart
    Running command: ceph-disk -v activate --mark-init upstart --mount /var/lib/ceph/osd/ceph-osd1
    DEBUG:ceph-disk:Cluster uuid is 8b2af1e6-92eb-4d74-9ca5-057522bb738f
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
    DEBUG:ceph-disk:Cluster name is ceph
    DEBUG:ceph-disk:OSD uuid is c8b2811c-fb19-49c3-b630-374a4db7073e
    DEBUG:ceph-disk:Allocating OSD id...
    INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring osd create --concise c8b2811c-fb19-49c3-b630-374a4db7073e
    DEBUG:ceph-disk:OSD id is 1
    DEBUG:ceph-disk:Initializing OSD...
    INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring mon getmap -o /var/lib/ceph/osd/ceph-osd1/activate.monmap
    got monmap epoch 1
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster ceph --mkfs --mkkey -i 1 --monmap /var/lib/ceph/osd/ceph-osd1/activate.monmap --osd-data /var/lib/ceph/osd/ceph-osd1 --osd-journal /var/lib/ceph/osd/ceph-osd1/journal --osd-uuid c8b2811c-fb19-49c3-b630-374a4db7073e --keyring /var/lib/ceph/osd/ceph-osd1/keyring
    2014-11-27 16:27:23.448198 b67e2740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
    2014-11-27 16:27:23.824770 b67e2740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
    2014-11-27 16:27:23.865648 b67e2740 -1 filestore(/var/lib/ceph/osd/ceph-osd1) could not find 23c2fcde/osd_superblock/0//-1 in index: (2) No such file or directory
    2014-11-27 16:27:23.885991 b67e2740 -1 created object store /var/lib/ceph/osd/ceph-osd1 journal /var/lib/ceph/osd/ceph-osd1/journal for osd.1 fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
    2014-11-27 16:27:23.887571 b67e2740 -1 auth: error reading file: /var/lib/ceph/osd/ceph-osd1/keyring: can't open /var/lib/ceph/osd/ceph-osd1/keyring: (2) No such file or directory
    2014-11-27 16:27:23.890124 b67e2740 -1 created new key in keyring /var/lib/ceph/osd/ceph-osd1/keyring
    DEBUG:ceph-disk:Marking with init system upstart
    DEBUG:ceph-disk:Authorizing OSD key...
    INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring auth add osd.1 -i /var/lib/ceph/osd/ceph-osd1/keyring osd allow * mon allow profile osd
    added key for osd.1
    DEBUG:ceph-disk:ceph osd.1 data dir is ready at /var/lib/ceph/osd/ceph-osd1
    DEBUG:ceph-disk:Creating symlink /var/lib/ceph/osd/ceph-1 -> /var/lib/ceph/osd/ceph-osd1
    DEBUG:ceph-disk:Starting ceph osd.1...
    INFO:ceph-disk:Running command: /sbin/initctl emit --no-wait -- ceph-osd cluster=ceph id=1
    checking OSD status...
    Running command: ceph --cluster=ceph osd stat --format=json
   root@compute:/home/mengfei/my-cluster#


(8)复制配置文件和管理密钥到管理节点和你的Ceph节点

   注:使用ceph-deploy命令将配置文件和管理密钥复制到管理节点和你的Ceph节点。
       下次你再使用ceph命令界面时就无需指定集群监视器地址,执行命令时也无需每次都指定ceph.client.admin.keyring

ceph-deploy admin compute controller network   (注:有时提示需要--overwrite-conf,实例中需要指定)

   root@compute:/home/mengfei/my-cluster# ceph-deploy admin compute controller network
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy --overwrite-conf admin compute controller network
    Pushing admin keys and conf to compute
    connected to host: compute
    detect platform information from remote host
    detect machine type
    get remote short hostname
    write cluster configuration to /etc/ceph/{cluster}.conf
    Pushing admin keys and conf to controller
    connected to host: controller
    detect platform information from remote host
    detect machine type
    get remote short hostname
    write cluster configuration to /etc/ceph/{cluster}.conf
    Pushing admin keys and conf to network
    connected to host: network
    detect platform information from remote host
    detect machine type
    get remote short hostname
    write cluster configuration to /etc/ceph/{cluster}.conf
   root@compute:/home/mengfei/my-cluster#
   root@compute:/home/mengfei/my-cluster#
   root@compute:/home/mengfei/my-cluster#


(9)验证osd
ceph osd tree   查看状态
ceph osd dump   查看osd配置信息
ceph osd rm   删除节点 remove osd(s) <id> [<id>...]
ceph osd crush rm osd.0   在集群中删除一个osd 硬盘 crush map
ceph osd crush rm node1   在集群中删除一个osd的host节点

   root@compute:/home/mengfei/my-cluster# ceph osd tree   (weight默认是0)
   # id    weighttype name       up/down reweight
   -1      0       root default
   -2      0               host controller
   0       0                     osd.0   up      1
   -3      0               host network
   1       0                     osd.1   up      1
   root@compute:/home/mengfei/my-cluster#

   root@compute:/var/log/ceph# ceph osd dump
   epoch 89
   fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
   created 2014-11-27 16:22:54.085639
   modified 2014-11-28 23:39:44.056533
   flags
   pool 0 'data' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 89 flags hashpspool crash_replay_interval 45 stripe_width 0
   pool 1 'metadata' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 88 flags hashpspool stripe_width 0
   pool 2 'rbd' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 87 flags hashpspool stripe_width 0
   max_osd 2
   osd.0 up   inweight 0 up_from 32 up_thru 82 down_at 31 last_clean_interval [15,29) 192.168.128.100:6800/2811 192.168.128.100:6801/2811 192.168.128.100:6802/2811 192.168.128.100:6803/2811 exists,up f4707c04-aeca-46fe-bf0e-f7e2d43d0524
   osd.1 up   inweight 0 up_from 33 up_thru 82 down_at 29 last_clean_interval [14,28) 192.168.128.102:6800/3105 192.168.128.102:6801/3105 192.168.128.102:6802/3105 192.168.128.102:6803/3105 exists,up c8b2811c-fb19-49c3-b630-374a4db7073e
   root@compute:/var/log/ceph#





mengfei888 发表于 2015-7-2 16:38:00

接上。。。。


(三)扩展集群

compute(mon0):增加一个osd进程osd2 和一个元数据服务器mds0
controller(osd0):增加一个监视器服务器mon1
network(osd1):增加一个监视器服务器mon2

   注:多个监视器服务器可以生成quoraum

1. 在compute上增加OSD节点

(1)compute节点创建osd2目录
ssh compute
mkdir -p /var/lib/ceph/osd/ceph-osd2

fdisk /dev/sdc
mkfs.rfs -f /dev/sdc1
mount/dev/sdc1 /var/lib/ceph/osd/ceph-osd2
mount -o remount,user_xattr/dev/sdc1 /var/lib/ceph/osd/ceph-osd2

vi /etc/fstab
/dev/sdc1 /var/lib/ceph/osd/ceph-osd2 xfs defaults 0 0
/dev/sdc1 /var/lib/ceph/osd/ceph-osd2 xfs remount,user_xattr 0 0

(2)在管理节点compute上,准备OSD
cd /home/mengfei/my-cluster
ceph-deploy osd prepare compute:/var/lib/ceph/osd/ceph-osd2
ceph-deploy osd activate compute:/var/lib/ceph/osd/ceph-osd2

   root@compute:/home/mengfei/my-cluster# ceph-deploy osd prepare compute:/var/lib/ceph/osd/ceph-osd2
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy osd prepare compute:/var/lib/ceph/osd/ceph-osd2
    Preparing cluster ceph disks compute:/var/lib/ceph/osd/ceph-osd2:
    connected to host: compute
    detect platform information from remote host
    detect machine type
    Distro info: Ubuntu 14.04 trusty
    Deploying osd to compute
    write cluster configuration to /etc/ceph/{cluster}.conf
    Running command: udevadm trigger --subsystem-match=block --action=add
    Preparing host compute disk /var/lib/ceph/osd/ceph-osd2 journal None activate False
    Running command: ceph-disk -v prepare --fs-type xfs --cluster ceph -- /var/lib/ceph/osd/ceph-osd2
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mkfs_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mkfs_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mount_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mount_options_xfs
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=osd_journal_size
    DEBUG:ceph-disk:Preparing osd data dir /var/lib/ceph/osd/ceph-osd2
    checking OSD status...
    Running command: ceph --cluster=ceph osd stat --format=json
    Host compute is now ready for osd use.
   root@compute:/home/mengfei/my-cluster#
   
   root@compute:/home/mengfei/my-cluster# ceph-deploy osd activate compute:/var/lib/ceph/osd/ceph-osd2
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy osd activate compute:/var/lib/ceph/osd/ceph-osd2
    Activating cluster ceph disks compute:/var/lib/ceph/osd/ceph-osd2:
    connected to host: compute
    detect platform information from remote host
    detect machine type
    Distro info: Ubuntu 14.04 trusty
    activating host compute disk /var/lib/ceph/osd/ceph-osd2
    will use init type: upstart
    Running command: ceph-disk -v activate --mark-init upstart --mount /var/lib/ceph/osd/ceph-osd2
    DEBUG:ceph-disk:Cluster uuid is 8b2af1e6-92eb-4d74-9ca5-057522bb738f
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
    DEBUG:ceph-disk:Cluster name is ceph
    DEBUG:ceph-disk:OSD uuid is 032998d3-03b5-458d-b32b-de48305e5b59
    DEBUG:ceph-disk:Allocating OSD id...
    INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring osd create --concise 032998d3-03b5-458d-b32b-de48305e5b59
    DEBUG:ceph-disk:OSD id is 2
    DEBUG:ceph-disk:Initializing OSD...
    INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring mon getmap -o /var/lib/ceph/osd/ceph-osd2/activate.monmap
    got monmap epoch 1
    INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster ceph --mkfs --mkkey -i 2 --monmap /var/lib/ceph/osd/ceph-osd2/activate.monmap --osd-data /var/lib/ceph/osd/ceph-osd2 --osd-journal /var/lib/ceph/osd/ceph-osd2/journal --osd-uuid 032998d3-03b5-458d-b32b-de48305e5b59 --keyring /var/lib/ceph/osd/ceph-osd2/keyring
    2014-11-28 14:32:34.800238 b6822740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
    2014-11-28 14:32:35.280160 b6822740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
    2014-11-28 14:32:35.304026 b6822740 -1 filestore(/var/lib/ceph/osd/ceph-osd2) could not find 23c2fcde/osd_superblock/0//-1 in index: (2) No such file or directory
    2014-11-28 14:32:35.370476 b6822740 -1 created object store /var/lib/ceph/osd/ceph-osd2 journal /var/lib/ceph/osd/ceph-osd2/journal for osd.2 fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
    2014-11-28 14:32:35.370543 b6822740 -1 auth: error reading file: /var/lib/ceph/osd/ceph-osd2/keyring: can't open /var/lib/ceph/osd/ceph-osd2/keyring: (2) No such file or directory
    2014-11-28 14:32:35.370712 b6822740 -1 created new key in keyring /var/lib/ceph/osd/ceph-osd2/keyring
    DEBUG:ceph-disk:Marking with init system upstart
    DEBUG:ceph-disk:Authorizing OSD key...
    INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring auth add osd.2 -i /var/lib/ceph/osd/ceph-osd2/keyring osd allow * mon allow profile osd
    added key for osd.2
    DEBUG:ceph-disk:ceph osd.2 data dir is ready at /var/lib/ceph/osd/ceph-osd2
    DEBUG:ceph-disk:Creating symlink /var/lib/ceph/osd/ceph-2 -> /var/lib/ceph/osd/ceph-osd2
    DEBUG:ceph-disk:Starting ceph osd.2...
    INFO:ceph-disk:Running command: /sbin/initctl emit --no-wait -- ceph-osd cluster=ceph id=2
    checking OSD status...
    Running command: ceph --cluster=ceph osd stat --format=json
   root@compute:/home/mengfei/my-cluster#


(3)增加OSD节点后,查看集群重新平衡状态
ceph osd tree
ceph -w
ceph -s
ceph osd dump

       root@compute:/home/mengfei/my-cluster# ceph osd tree   (weight默认是0)
       # id    weighttype name       up/down reweight
       -1      0       root default
       -2      0               host controller
       0       0                     osd.0   up      1
       -3      0               host network
       1       0                     osd.1   up      1
       -4      0               host compute
       2       0                     osd.2   up      1
       root@compute:/home/mengfei/my-cluster#

      [root@compute:/home/mengfei/my-cluster# ceph -w      (由于没修改weight权重值,所以下边状态是192 creating+incomplete)
          cluster 8b2af1e6-92eb-4d74-9ca5-057522bb738f
         health HEALTH_WARN 192 pgs incomplete; 192 pgs stuck inactive; 192 pgs stuck unclean; 50 requests are blocked > 32 sec
         monmap e3: 3 mons at {compute=192.168.128.101:6789/0,controller=192.168.128.100:6789/0,network=192.168.128.102:6789/0}, election epoch 6, quorum 0,1,2 controller,compute,network
         mdsmap e5: 1/1/1 up {0=compute=up:creating}
         osdmap e23: 3 osds: 3 up, 3 in
            pgmap v50: 192 pgs, 3 pools, 0 bytes data, 0 objects
                  398 MB used, 2656 MB / 3054 MB avail
                     192 creating+incomplete
      root@compute:/home/mengfei/my-cluster#
      
      root@compute:/home/mengfei/my-cluster# ceph osd dump
      epoch 23
      fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
      created 2014-11-27 16:22:54.085639
      modified 2014-11-28 16:30:06.501906
      flags
      pool 0 'data' replicated size 3 min_size 2 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 1 flags hashpspool crash_replay_interval 45 stripe_width 0
      pool 1 'metadata' replicated size 3 min_size 2 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 1 flags hashpspool stripe_width 0
      pool 2 'rbd' replicated size 3 min_size 2 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 1 flags hashpspool stripe_width 0
      max_osd 3
      osd.0 up   inweight 0 up_from 15 up_thru 15 down_at 12 last_clean_interval [4,11) 192.168.128.100:6800/3272 192.168.128.100:6801/3272 192.168.128.100:6802/3272 192.168.128.100:6803/3272 exists,up f4707c04-aeca-46fe-bf0e-f7e2d43d0524
      osd.1 up   inweight 0 up_from 14 up_thru 0 down_at 13 last_clean_interval [8,12) 192.168.128.102:6800/3272 192.168.128.102:6801/3272 192.168.128.102:6802/3272 192.168.128.102:6803/3272 exists,up c8b2811c-fb19-49c3-b630-374a4db7073e
      osd.2 up   inweight 0 up_from 22 up_thru 0 down_at 21 last_clean_interval [19,19) 192.168.128.101:6801/16367 192.168.128.101:6802/16367 192.168.128.101:6803/16367 192.168.128.101:6804/16367 exists,up 032998d3-03b5-458d-b32b-de48305e5b59
      root@compute:/home/mengfei/my-cluster#


2. 在compute上增加元数据服务器
   注:为使用CephFS文件系统,至少需要一台元数据服务器
   注:当前Ceph产品仅支持一个元数据服务器,可尝试运行多个,但不受商业支持
ceph-deploy mds create compute
ceph mds stat查看状态
ceph mds dump查看状态

   [root@compute:/home/mengfei/my-cluster# ceph-deploy mds create compute
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy mds create compute
    Deploying mds, cluster ceph hosts compute:compute
    connected to host: compute
    detect platform information from remote host
    detect machine type
    Distro info: Ubuntu 14.04 trusty
    remote host will use upstart
    deploying mds bootstrap to compute
    write cluster configuration to /etc/ceph/{cluster}.conf
    create path if it doesn't exist
    Running command: ceph --cluster ceph --name client.bootstrap-mds --keyring /var/lib/ceph/bootstrap-mds/ceph.keyring auth get-or-create mds.compute osd allow rwx mds allow mon allow profile mds -o /var/lib/ceph/mds/ceph-compute/keyring
    Running command: initctl emit ceph-mds cluster=ceph id=compute
   Unhandled exception in thread started by
   [root@compute:/home/mengfei/my-cluster#

   root@compute:/home/mengfei/my-cluster# ceph mds stat
   e3: 1/1/1 up {0=compute=up:creating}
   root@compute:/home/mengfei/my-cluster#
   
   root@compute:/home/mengfei/my-cluster# ceph mds dump
   dumped mdsmap epoch 3
   epoch   3
   flags   0
   created 2014-11-27 16:22:54.081490
   modified      2014-11-28 14:45:35.509558
   tableserver   0
   root    0
   session_timeout 60
   session_autoclose       300
   max_file_size   1099511627776
   last_failure    0
   last_failure_osd_epoch0
   compatcompat={},rocompat={},incompat={1=base v0.20,2=client writeable ranges,3=default file layouts on dirs,4=dir inode in separate object,5=mds uses versioned encoding,6=dirfrag is stored in omap}
   max_mds 1
   in      0
   up      {0=4306}
   failed
   stopped
   data_pools      0
   metadata_pool   1
   inline_data   disabled
   4306:   192.168.128.101:6805/7363 'compute' mds.0.1 up:creating seq 1
   root@compute:/home/mengfei/my-cluster#


删除元数据:(注:删除元数据mds时,会提示以下信息,必须降低max_mds)
root@compute:/home/mengfei# ceph mds stop 0
Error EBUSY: must decrease max_mds or else MDS will immediately reactivate
root@compute:/home/mengfei#

root@compute:/home/mengfei# ceph mds set_max_mds 0   (降低max值)
max_mds = 0
root@compute:/home/mengfei#

root@compute:/home/mengfei# ceph mds stop 0
telling mds.0 192.168.128.101:6800/26057 to deactivate
root@compute:/home/mengfei#





3. 在controller=osd0/network=osd1节点增加监视器mon1和mon2
      注:Ceph使用Paxos算法,需要多个Ceph监视器组成Quoram(如1,2:3,3:4,3:5,4:6等)

ceph-deploy admin create controller network   (注:重新分发以下的配置文件)
ceph-deploy mon create controller network

      注:执行以上命令时,提示/var/run/ceph/ceph-mon.controller.asok not found. 主要还是ceph.conf文件不对
      添加相关项再push到所有节点后就正常了。
vi /home/mengfei/my-cluster/ceph.conf    (以下项也并不全面,稍后再改)
   
   fsid = 8b2af1e6-92eb-4d74-9ca5-057522bb738f
   mon_initial_members = compute,controller,network
   mon_host = 192.168.128.101,192.168.128.100,192.168.128.102
   public network = 192.168.128.0/24
   auth_cluster_required = cephx
   auth_service_required = cephx
   auth_client_required = cephx
   #filestore_xattr_use_omap = true
   
   
   osd journal size = 100
   filestore_xattr_use_omap = true
   osd pool default size = 3      
   osd pool default min_size = 1
   osd crush chooseleaf type = 1
   
   
   host = controller
   
   
   host = network
   
   
   host = compute
   
   
   host = compute
   mon_addr = 192.168.128.101:6789
   
   
   host = controller
   mon_addr = 192.168.128.100:6789
   
   
   host = network
   mon_addr = 192.168.128.102:6789
   
   
   host = compute


   root@compute:/home/mengfei/my-cluster# ceph-deploy mon create controller network
    found configuration file at: /root/.cephdeploy.conf
    Invoked (1.5.20): /usr/bin/ceph-deploy mon create controller network
    Deploying mon, cluster ceph hosts controller network
    detecting platform for host controller ...
    connected to host: controller
    detect platform information from remote host
    detect machine type
    distro info: Ubuntu 14.04 trusty
    determining if provided host has same hostname in remote
    get remote short hostname
    deploying mon to controller
    get remote short hostname
    remote hostname: controller
    write cluster configuration to /etc/ceph/{cluster}.conf
    create the mon path if it does not exist
    checking for done path: /var/lib/ceph/mon/ceph-controller/done
    create a done file to avoid re-doing the mon deployment
    create the init path if it does not exist
    locating the `service` executable...
    Running command: initctl emit ceph-mon cluster=ceph id=controller
    Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.controller.asok mon_status
    ********************************************************************************
    status for monitor: mon.controller
    {
      "election_epoch": 0,
      "extra_probe_peers": [
      "192.168.128.101:6789/0"
      ],
      "monmap": {
      "created": "0.000000",
      "epoch": 1,
      "fsid": "8b2af1e6-92eb-4d74-9ca5-057522bb738f",
      "modified": "0.000000",
      "mons": [
          {
            "addr": "192.168.128.101:6789/0",
            "name": "compute",
            "rank": 0
          }
      ]
      },
      "name": "controller",
      "outside_quorum": [],
      "quorum": [],
      "rank": -1,
      "state": "probing",
      "sync_provider": []
    }
    ********************************************************************************
    monitor: mon.controller is currently at the state of probing
    Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.controller.asok mon_status
    monitor controller does not exist in monmap
    detecting platform for host network ...
    connected to host: network
    detect platform information from remote host
    detect machine type
    distro info: Ubuntu 14.04 trusty
    determining if provided host has same hostname in remote
    get remote short hostname
    deploying mon to network
    get remote short hostname
    remote hostname: network
    write cluster configuration to /etc/ceph/{cluster}.conf
    create the mon path if it does not exist
    checking for done path: /var/lib/ceph/mon/ceph-network/done
    create a done file to avoid re-doing the mon deployment
    create the init path if it does not exist
    locating the `service` executable...
    Running command: initctl emit ceph-mon cluster=ceph id=network
    Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.network.asok mon_status
    ********************************************************************************
    status for monitor: mon.network
    {
      "election_epoch": 1,
      "extra_probe_peers": [
      "192.168.128.101:6789/0"
      ],
      "monmap": {
      "created": "0.000000",
      "epoch": 3,
      "fsid": "8b2af1e6-92eb-4d74-9ca5-057522bb738f",
      "modified": "2014-11-28 16:18:49.267793",
      "mons": [
          {
            "addr": "192.168.128.100:6789/0",
            "name": "controller",
            "rank": 0
          },
          {
            "addr": "192.168.128.101:6789/0",
            "name": "compute",
            "rank": 1
          },
          {
            "addr": "192.168.128.102:6789/0",
            "name": "network",
            "rank": 2
          }
      ]
      },
      "name": "network",
      "outside_quorum": [],
      "quorum": [],
      "rank": 2,
      "state": "electing",
      "sync_provider": []
    }
    ********************************************************************************
    monitor: mon.network is running
    Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.network.asok mon_status
   root@compute:/home/mengfei/my-cluster#



查看监视器quorum状态    (注:增加监视器后,ceph将同步各监视器并形成quarum)
ceph mon stat
ceph mon_status
ceph mon dump
ceph quorum_status

   root@compute:/home/mengfei/my-cluster# ceph mon stat
   e3: 3 mons at {compute=192.168.128.101:6789/0,controller=192.168.128.100:6789/0,network=192.168.128.102:6789/0}, election epoch 6, quorum 0,1,2 controller,compute,network
   root@compute:/home/mengfei/my-cluster#
   
   root@compute:/home/mengfei/my-cluster# ceph mon_status
   {"name":"controller","rank":0,"state":"leader","election_epoch":6,"quorum":,"outside_quorum":[],"extra_probe_peers":["192.168.128.101:6789\/0","192.168.128.102:6789\/0"],"sync_provider":[],"monmap":{"epoch":3,"fsid":"8b2af1e6-92eb-4d74-9ca5-057522bb738f","modified":"2014-11-28 16:18:49.267793","created":"0.000000","mons":[{"rank":0,"name":"controller","addr":"192.168.128.100:6789\/0"},{"rank":1,"name":"compute","addr":"192.168.128.101:6789\/0"},{"rank":2,"name":"network","addr":"192.168.128.102:6789\/0"}]}}
   root@compute:/home/mengfei/my-cluster#
   
   root@compute:/home/mengfei/my-cluster# ceph mon dump
   dumped monmap epoch 3
   epoch 3
   fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
   last_changed 2014-11-28 16:18:49.267793
   created 0.000000
   0: 192.168.128.100:6789/0 mon.controller
   1: 192.168.128.101:6789/0 mon.compute
   2: 192.168.128.102:6789/0 mon.network
   root@compute:/home/mengfei/my-cluster#   
   
   root@compute:/home/mengfei/my-cluster# ceph quorum_status
      {"election_epoch":6,"quorum":,"quorum_names":["controller","compute","network"],"quorum_leader_name":"controller","monmap":{"epoch":3,"fsid":"8b2af1e6-92eb-4d74-9ca5-057522bb738f","modified":"2014-11-28 16:18:49.267793","created":"0.000000","mons":[{"rank":0,"name":"controller","addr":"192.168.128.100:6789\/0"},{"rank":1,"name":"compute","addr":"192.168.128.101:6789\/0"},{"rank":2,"name":"network","addr":"192.168.128.102:6789\/0"}]}}
      root@compute:/home/mengfei/my-cluster#

(四)验证集群osd,检查集群健康状况
ceph health   查看健康状态
ceph auth list查看认证状态
ceph osd tree   查看状态
ceph -s         查看状态
ceph -w         查看实时状态(和-s内容一样)
ceph osd dump   查看osd配置信息
ceph osd rm   删除节点 remove osd(s) <id> [<id>...]
ceph osd crush rm osd.0   在集群中删除一个osd 硬盘 crush map
ceph osd crush rm node1   在集群中删除一个osd的host节点

以下是修改object副本个数及最小个数命令(也可以在ceph.conf中指定):
ceph osd pool set data size 3
ceph osd pool set metadata size 3
ceph osd pool set rbd size 3

ceph osd pool set data min_size 1
ceph osd pool set metadata min_size 1
ceph osd pool set rbd min_size 1

以下是修改允许的最大时钟差(默认情况下,实例中报ceph -w会报:clock skew detected on mon.compute,ceph health detail可看详细,修改此值为0.5,就会health oK)

mon_clock_drift_allowed = 0.5

以下是修改weight权重值命令:
ceph osd crush set 0 1.0 host=controller
ceph osd crush set 1 1.0 host=network
ceph osd crush set 2 1.0 host=compute

    注:你将会看到PG状态由活跃且干净状态变成活跃态,其中存在部分降级对象。当迁移完成后,
          将再次返回活跃且干净状态。(可按Control+c组合键退出)

   root@compute:/var/log/ceph# ceph osd crush set 0 1.0 host=controller
   set item id 0 name 'osd.0' weight 1 at location {host=controller} to crush map
   root@compute:/var/log/ceph# ceph osd crush set 1 1.0 host=network
   set item id 1 name 'osd.1' weight 1 at location {host=network} to crush map
   root@compute:/var/log/ceph# ceph osd crush set 2 1.0 host=compute
   set item id 2 name 'osd.2' weight 1 at location {host=compute} to crush map
   root@compute:/var/log/ceph#

   root@compute:/home/mengfei/my-cluster# ceph osd tree   (weight默认是0)
   # id    weighttype name       up/down reweight
   -1      0       root default
   -2      0               host controller
   0       0                     osd.0   up      1
   -3      0               host network
   1       0                     osd.1   up      1
   -4      0               host compute
   2       0                     osd.2   up      1
   root@compute:/home/mengfei/my-cluster#
   
   root@compute:/home/mengfei/my-cluster# ceph -s   (inactive+unclean状态,实例中修改weight值为1之后<默认是0>,就正常了)
         cluster 8b2af1e6-92eb-4d74-9ca5-057522bb738f
          health HEALTH_WARN 192 pgs incomplete; 192 pgs stuck inactive; 192 pgs stuck unclean
          monmap e1: 1 mons at {compute=192.168.128.101:6789/0}, election epoch 1, quorum 0 compute
          osdmap e16: 2 osds: 2 up, 2 in
         pgmap v31: 192 pgs, 3 pools, 0 bytes data, 0 objects
               266 MB used, 1770 MB / 2036 MB avail
                      192 creating+incomplete
   root@compute:/home/mengfei/my-cluster#
   
   root@compute:/home/mengfei/my-cluster# ceph osd tree   (weight值修改为1)
   # id    weighttype name       up/down reweight
   -1      3       root default
   -2      1               host controller
   0       1                     osd.0   up      1
   -3      1               host network
   1       1                     osd.1   up      1
   -4      1               host compute
   2       1                     osd.2   up      1
   root@compute:/home/mengfei/my-cluster#
   
   root@compute:/home/mengfei/my-cluster# ceph -s   (以下是修改weight=1后的输出)
   cluster 8b2af1e6-92eb-4d74-9ca5-057522bb738f
   health HEALTH_WARN clock skew detected on mon.compute, mon.network   (这是时钟偏移报警,应该没事,修改ceph.conf mon_clock_drift_allowed = 0.5解决)
   monmap e3: 3 mons at {compute=192.168.128.101:6789/0,controller=192.168.128.100:6789/0,network=192.168.128.102:6789/0}, election epoch 30, quorum 0,1,2 controller,compute,network
   mdsmap e14: 1/1/1 up {0=compute=up:active}
   osdmap e89: 3 osds: 3 up, 3 in
      pgmap v351: 192 pgs, 3 pools, 1884 bytes data, 20 objects
            406 MB used, 2648 MB / 3054 MB avail
               192 active+clean
   root@compute:/home/mengfei/my-cluster#

   root@compute:/var/log/ceph# ceph osd dump
   epoch 89
   fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
   created 2014-11-27 16:22:54.085639
   modified 2014-11-28 23:39:44.056533
   flags
   pool 0 'data' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 89 flags hashpspool crash_replay_interval 45 stripe_width 0
   pool 1 'metadata' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 88 flags hashpspool stripe_width 0
   pool 2 'rbd' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 87 flags hashpspool stripe_width 0
   max_osd 3
   osd.0 up   inweight 1 up_from 32 up_thru 82 down_at 31 last_clean_interval [15,29) 192.168.128.100:6800/2811 192.168.128.100:6801/2811 192.168.128.100:6802/2811 192.168.128.100:6803/2811 exists,up f4707c04-aeca-46fe-bf0e-f7e2d43d0524
   osd.1 up   inweight 1 up_from 33 up_thru 82 down_at 29 last_clean_interval [14,28) 192.168.128.102:6800/3105 192.168.128.102:6801/3105 192.168.128.102:6802/3105 192.168.128.102:6803/3105 exists,up c8b2811c-fb19-49c3-b630-374a4db7073e
   osd.2 up   inweight 1 up_from 35 up_thru 82 down_at 30 last_clean_interval [27,29) 192.168.128.101:6801/3173 192.168.128.101:6802/3173 192.168.128.101:6803/3173 192.168.128.101:6804/3173 exists,up 032998d3-03b5-458d-b32b-de48305e5b59
   root@compute:/var/log/ceph#


(五)存储/恢复对象数据

    注:为了能够操作Ceph存储集群中的对象数据,Ceph客户端必需满足:
    1. 设置一个对象名
    2. 指定一个数据池
   
    Ceph客户端取回最新的集群映射表,并根据CRUSH算法先计算如何将对象映射到某个PG中,
    然后再计算如何将该PG动态映射入一个Ceph OSD进程上。为了查找对象位置,你需要的仅仅是对象名称和数据池名称

   ceph osd map {poolname} {object-name}

1.练习:定位一个对象
作为一个练习,我们先创建一个对象。使用rados put命令指定对象名称、存储对象数据的测试文件路径和地址池名称。例如:

格式:rados put {object-name} {file-path} --pool=data
rados put zhi-ceph zhi.txt --pool=data

2. 为了验证Ceph存储集群已存储该对象,执行如下命令
rados -p data ls
rados -p metadta ls

   root@compute:/home/mengfei/my-cluster# rados -p data ls
   zhi-ceph
   root@compute:/home/mengfei/my-cluster#

   root@compute:/home/mengfei/my-cluster# rados -p metadata ls
   609.00000000
   mds0_sessionmap
   608.00000000
   601.00000000
   602.00000000
   mds0_inotable
   1.00000000.inode
   200.00000000
   604.00000000
   605.00000000
   mds_anchortable
   mds_snaptable
   600.00000000
   603.00000000
   100.00000000
   200.00000001
   606.00000000
   607.00000000
   100.00000000.inode
   1.00000000
   root@compute:/home/mengfei/my-cluster#

3. 现在,可标识对象位置
格式:ceph osd map {pool-name} {object-name}
ceph osd map data zhi-ceph
ceph osd map metadata zhi-ceph

   root@compute:/home/mengfei/my-cluster# ceph osd map data zhi-ceph
   osdmap e89 pool 'data' (0) object 'zhi-ceph' -> pg 0.e67b1a3 (0.23) -> up (, p1) acting (, p1)
   root@compute:/home/mengfei/my-cluster#
   
   root@compute:/home/mengfei/my-cluster# ceph osd map metadata zhi-ceph
   osdmap e89 pool 'metadata' (1) object 'zhi-ceph' -> pg 1.e67b1a3 (1.23) -> up (, p0) acting (, p0)
   root@compute:/home/mengfei/my-cluster#

Ceph将输出对象位置信息,例如:
osdmap e537 pool 'data' (0) object 'test-object-1' -> pg 0.d1743484 (0.4) -> up acting

4. 删除测试对象,使用rados rm命令

rados rm zhi-ceph --pool=data

注:当集群扩展后,对象位置可能会动态变更。Ceph动态平衡的一个好处就是Ceph可自动完成迁移而无须你手动操作

5.创建一个池
ceph osd pool create zhi-pool 128
ceph osd pool set zhi-pool min_size 1
ceph -w 查看实时的迁移状态


(六)块设备快速启动
      注:要使用这个指南,你必须首先在对象存储快速启动引导中执行程序。在使用Ceph的块设备工作前,确保您的Ceph的存储集群是在主动 + 清洁状态。在admin节点上执行这个快速启动。
注意:Ceph的块设备也被称为RBD或RADOS块设备

1. 安装 Ceph

(1) 检查linux的内核版本

lsb_release -a
uname -r

(2)在管理节点上,用ceph-deploy安装Ceph在你的ceph-client节点 (注:前边已经安装过,这里不再执行)
ceph-deploy install network(实例中将network作为client端)

(3) 在管理节点上,用ceph-deploy复制Ceph配置文件和ceph.client.admin.keyring到你的ceph-client
ceph-deploy admin network


2. 配置一个块设备

(1)在ceph-client节点上,创建一个块设备的镜像
rbd create foo --size 4096 [-m {mon-IP}] [-k /path/to/ceph.client.admin.keyring]

ceph osd pool create rbd-pool 128
ceph osd pool set rbd-pool min_size 1

rbd create foo --size 512
rbd create bar --size 256 --pool rbd-pool
rbd create zhi --size 512 --pool rbd-pool

ceph osd pool delete rbd-pool rbd-pool --yes-i-really-really-mean-it
   注:删除一个pool,要指定两次poolname 并加上后边的参数

验证查询块设备信息:
rbd ls               查看块设备镜像
rbd ls rbd-pool      列出块设备在一个特定的池
rbd --image foo info 从一个特定的镜像查询信息
rbd --image bar -p rbd-pool info   查询一个池内的镜像信息

   root@network:/home/mengfei/my-cluster# rbd ls
   foo
   root@network:/home/mengfei/my-cluster#
   
   root@network:/home/mengfei/my-cluster# rbd ls rbd-pool
   bar
   root@network:/home/mengfei/my-cluster#

   root@network:/home/mengfei# rbd showmapped
   id pool   image snap device   
   1rbd      foo   -    /dev/rbd1
   2rbd-pool zhi   -    /dev/rbd2
   3rbd-pool bar   -    /dev/rbd3
   root@network:/home/mengfei#
   
   root@network:/home/mengfei/my-cluster# rbd --image foo info
   rbd image 'foo':
             size 512 MB in 128 objects
             order 22 (4096 kB objects)
             block_name_prefix: rb.0.16cb.2ae8944a
             format: 1
   root@network:/home/mengfei/my-cluster#

   root@network:/home/mengfei/my-cluster# rbd --image bar -p rbd-pool info
   rbd image 'bar':
             size 512 MB in 128 objects
             order 22 (4096 kB objects)
             block_name_prefix: rb.0.16b8.2ae8944a
             format: 1
   root@network:/home/mengfei/my-cluster#
   
调整块设备镜像
Ceph的块设备镜像精简置备。他们实际上不使用任何物理存储,直到你开始保存数据。
然而,他们有一个最大容量-大小选项设置。如果你想增加(或减少)一个的CEPH座设备镜像的最大尺寸,执行以下命令:
rbd resize --image foo --size 1024   增加size
rbd resize --image foo --allow-shrink --size 512减小size

rbd resize --image zhi -p rbd-pool --size 1024
rbd resize --image zhi -p rbd-pool --allow-shrink --size 256   

删除块设备镜像
rbd rm foo   要删除一个块设备
rbd rm bar -p rbd-pool   从池中删除一个块设备


(2)在ceph-client节点上,加载rbd客户端模块
modprobe rbd

(3)在ceph-client节点上,映射这个镜像到一个块设备
rbd map foo --pool rbd --name client.admin [-m {mon-IP}] [-k /path/to/ceph.client.admin.keyring]

rbd map foo
rbd map bar --pool rbd-pool
rbd map zhi --pool rbd-pool(实例中以zhi为例来操作)

rbd showmapped       显示映射块设备

取消块设备映射
格式:rbd unmap /dev/rbd/{poolname}/{imagename}
rbd unmap /dev/rbd/rbd/foo

(4)用这个块设备在一个ceph-client节点network上创建一个文件系统
mkfs.ext4 -m0 /dev/rbd/rbd-pool/zhi   (这个可能要花费几分钟)

      root@network:/home/mengfei/my-cluster# rbd map zhi --pool rbd-pool
      root@network:/home/mengfei/my-cluster# mkfs.ext4 -m0 /dev/rbd/rbd-pool/zhi
      mke2fs 1.42.9 (4-Feb-2014)
      Filesystem label=
      OS type: Linux
      Block size=4096 (log=2)
      Fragment size=4096 (log=2)
      Stride=1024 blocks, Stripe width=1024 blocks
      32768 inodes, 131072 blocks
      0 blocks (0.00%) reserved for the super user
      First data block=0
      Maximum filesystem blocks=134217728
      4 block groups
      32768 blocks per group, 32768 fragments per group
      8192 inodes per group
      Superblock backups stored on blocks:
            32768, 98304
      
      Allocating group tables: done                           
      Writing inode tables: done                           
      Creating journal (4096 blocks): done
      Writing superblocks and filesystem accounting information: done
      
      root@network:/home/mengfei/my-cluster#


(5)挂载这个文件系统到你的ceph-client节点上
mkdir /mnt/ceph-block-device
mount /dev/rbd/rbd-pool/zhi /mnt/ceph-block-device
cd /mnt/ceph-block-device

修改自动加载:
默认情况下,创建块设备后,有个/etc/init.d/rbdmap文件
root@network:/home/mengfei# vi /etc/ceph/rbdmap   (不知道为何,在管理节点compute有这个文件,client节点network上没有,暂从compute拷过来)
# RbdDevice             Parameters
#poolname/imagename   id=client,keyring=/etc/ceph/ceph.client.keyring
rbd-pool/zhi
#rbd-pool/bar
#rbd/foo
   
   注:因为如果禁用了cephx,所以不必配置keyring了。
   这样就可以手动控制、并且开关机可以自动挂载和卸载rbd块设备了
   

配置rbdmap (实例中/etc/init.d/rbdmap会自动生成)
创建rbd块设备并rbd map后,如果不及时rbd unmap,关机的时候系统会hung在umount此rbd设备上。
所以配置rbdmap是必须的。首先下载并设置开机启动rbdmap
$ sudo wget https://raw.github.com/ceph/ceph/a4ddf704868832e119d7949e96fe35ab1920f06a/src/init-rbdmap -O /etc/init.d/rbdmap
$ sudo chmod +x /etc/init.d/rbdmap
$ sudo update-rc.d rbdmap defaults

   root@network:/home/mengfei# update-rc.d rbdmap defaults
      Adding system startup for /etc/init.d/rbdmap ...
      /etc/rc0.d/K20rbdmap -> ../init.d/rbdmap
      /etc/rc1.d/K20rbdmap -> ../init.d/rbdmap
      /etc/rc6.d/K20rbdmap -> ../init.d/rbdmap
      /etc/rc2.d/S20rbdmap -> ../init.d/rbdmap
      /etc/rc3.d/S20rbdmap -> ../init.d/rbdmap
      /etc/rc4.d/S20rbdmap -> ../init.d/rbdmap
      /etc/rc5.d/S20rbdmap -> ../init.d/rbdmap
   root@network:/home/mengfei#




(6)验证rbd的信息
ceph -w
rados -p rbd-pool ls
ceph osd map rbd-pool rbd

   root@network:/home/mengfei/my-cluster# rados -p rbd-pool ls
   rb.0.16df.2ae8944a.000000000041
   rb.0.16df.2ae8944a.000000000042
   rbd_directory
   rb.0.16df.2ae8944a.000000000060
   rb.0.16df.2ae8944a.000000000001
   rb.0.16df.2ae8944a.000000000020
   bar.rbd
   zhi.rbd
   rb.0.16df.2ae8944a.000000000002
   rb.0.16df.2ae8944a.000000000040
   rb.0.16df.2ae8944a.000000000043
   rb.0.16df.2ae8944a.00000000007f
   rb.0.16df.2ae8944a.000000000000
   root@network:/home/mengfei/my-cluster#
   
   root@network:/home/mengfei/my-cluster# ceph osd map rbd-pool rbd
   osdmap e139 pool 'rbd-pool' (4) object 'rbd' -> pg 4.7a31dfd8 (4.58) -> up (, p1) acting (, p1)
   root@network:/home/mengfei/my-cluster#




(七)Ceph的文件系统快速入门

(1)先决条件
安装 ceph-common.
apt-get install ceph-common

   注:确保Ceph的存储集群正在运行,并且在活跃 + 清洁状态。此外,确保你至少有一个Ceph的元数据服务器运行
ceph -s

      root@compute:/var/lib/ceph/osd/ceph-osd2/current# ceph -s
          cluster 8b2af1e6-92eb-4d74-9ca5-057522bb738f
         health HEALTH_OK
         monmap e3: 3 mons at {compute=192.168.128.101:6789/0,controller=192.168.128.100:6789/0,network=192.168.128.102:6789/0}, election epoch 72, quorum 0,1,2 controller,compute,network
         mdsmap e34: 1/1/1 up {0=compute=up:active}
         osdmap e139: 3 osds: 3 up, 3 in
            pgmap v758: 448 pgs, 5 pools, 60758 kB data, 43 objects
                  584 MB used, 2470 MB / 3054 MB avail
                     448 active+clean
      root@compute:/var/lib/ceph/osd/ceph-osd2/current#

(2)创建一个文件系统
ceph osd pool create cephfs_data 64      (注:用ceph osd dump可以看到新生成的pool名及int号,生成此pool id是5)
ceph osd pool create cephfs_metadata 64(生成此pool的id为6)
#ceph fs newfs mycephfs cephfs_metadata cephfs_data 此命令不对,用以下命令
ceph osd dump
                                          
创建新的mds fs pool:
格式:ceph mds newfs <int> <int> {--yes-i-really-mean-it} :make new filesystom using pools <metadata> and <data>                                       

ceph mds dump
ceph mds newfs 6 5 --yes-i-really-mean-it

   root@compute:/home/mengfei/my-cluster# ceph mds newfs 6 5 --yes-i-really-mean-it
   new fs with metadata pool 6 and data pool 5
   root@compute:/home/mengfei/my-cluster#

ceph mds add_data_pool cephfs_data (如果要添加pool,可以用此命令,如添加的是默认pool:data会无法删除,只能上边命令新建)

      root@compute:/home/mengfei/my-cluster# ceph mds dump   (注:默认情况下data_pools和metadata_pool是0和1)
      dumped mdsmap epoch 201
      epoch   201
      flags   0
      created 2014-12-04 15:09:55.788256
      modified      2014-12-04 15:09:57.898040
      tableserver   0
      root    0
      session_timeout 60
      session_autoclose       300
      max_file_size   1099511627776
      last_failure    0
      last_failure_osd_epoch0
      compatcompat={},rocompat={},incompat={1=base v0.20,2=client writeable ranges,3=default file layouts on dirs,4=dir inode in separate object,5=mds uses versioned encoding,6=dirfrag is stored in omap}
      max_mds 1
      in      0
      up      {0=7938}
      failed
      stopped
      data_pools      0
      metadata_pool   1
      inline_data   disabled
      7938:   192.168.128.101:6809/29475 'compute' mds.0.34 up:active seq 2
      7883:   192.168.128.100:6807/21690 'controller' mds.-1.0 up:standby seq 1500
      root@compute:/home/mengfei/my-cluster#

      root@compute:/home/mengfei/my-cluster# ceph mds newfs 6 5 --yes-i-really-mean-it
      new fs with metadata pool 6 and data pool 5
      root@compute:/home/mengfei/my-cluster#
      root@compute:/home/mengfei/my-cluster# ceph mds dump
      dumped mdsmap epoch 206
      epoch   206
      flags   0
      created 2014-12-04 15:15:10.430339
      modified      2014-12-04 15:15:13.558941
      tableserver   0
      root    0
      session_timeout 60
      session_autoclose       300
      max_file_size   1099511627776
      last_failure    0
      last_failure_osd_epoch0
      compatcompat={},rocompat={},incompat={1=base v0.20,2=client writeable ranges,3=default file layouts on dirs,4=dir inode in separate object,5=mds uses versioned encoding,6=dirfrag is stored in omap}
      max_mds 1
      in      0
      up      {0=7953}
      failed
      stopped
      data_pools      5
      metadata_pool   6
      inline_data   disabled
      7953:   192.168.128.101:6810/29475 'compute' mds.0.35 up:active seq 2
      7883:   192.168.128.100:6807/21690 'controller' mds.-1.0 up:standby seq 1579
      root@compute:/home/mengfei/my-cluster#


(3)创建一个秘钥文件
Ceph的存储集群运行的身份验证默认打开的。你应该有一个文件,其中包含密钥。为了获得针对特定用户的密钥,请执行下列步骤:

<1> 识别keyring文件内的一个用户。例如:
cat ceph.client.admin.keyring

   root@compute:/home/mengfei/my-cluster# cat ceph.client.admin.keyring
   
             key = AQBe33ZUQBvWFBAApPAN9YAiqSFJQrTXv/TM1A==
   root@compute:/home/mengfei/my-cluster#

<2> 用户将使用安装Ceph的FS文件系统复制。操作步骤如下:

   key = AQCj2YpRiAe6CxAA7/ETt7Hcl9IyxyYciVs47w==

<3> 打开一个文本编辑器

<4> 粘贴秘钥到一个空文件。操作步骤如下:
AQCj2YpRiAe6CxAA7/ETt7Hcl9IyxyYciVs47w==

<5> 将文件保存的用户名作为一个属性(例如, /etc/ceph/admin.secret).

<6> Ensure the file permissions are appropriate for the user, but notvisible to other users.



(4)内核驱动
mkdir /mnt/mycephfs

格式:mount -t ceph {ip-address-of-monitor}:6789:/ /mnt/mycephfs
mount -t ceph {ip-address-of-monitor}:6789:/ /mnt/mycephfs

Ceph的存储集群,默认情况下,使用验证。指定一个用户的名称和secretfile中创建创建一个秘密的文件部分。例如:
mount -t ceph 192.168.128.101:6789:/ /mnt/mycephfs -o name=admin,secretfile=/etc/ceph/admin.secret

   注意:在client节点上挂载Ceph FS文件系统,而不是服务器monitor节点。

在客户端client上执行:
vi /etc/fstab添加开机自动mount
格式:{ipaddress}:{port}:/ /{mountpoint} {filesystem-name}   ,[{mount.options}]
192.168.128.101:6789:/ /mnt/mycephfs ceph name=admin,secretfile=/etc/ceph/admin.secret,noatime 0 2

(5)Ceph的文件系统(FUSE)(注:此步没操作成功,提示ceph-fuse:command not found)

载Ceph FS作为一个在用户空间文件系统(FUSE)
mkdir ~/mycephfs

格式:ceph-fuse -m {ip-address-of-monitor}:6789 ~/mycephfs

Ceph的存储集群在默认情况下使用验证。如果它不是在默认位置,指定一个密钥(即:/etc/ceph):
ceph-fuse -k ./ceph.client.admin.keyring -m 192.168.128.101:6789 ~/mycephfs

mengfei888 发表于 2015-7-2 16:39:01

发重复了,故删掉一条
页: [1]
查看完整版本: Ceph 三节点安装配置