Ceph 三节点安装配置
ceph安装配置:
由于以下环境是在装有openstack的三节点上安装的,故主机名是:controller/compute/network
但是配置ceph,可以添加mon、mds、osd、client等名称到/etc/hosts里
注:使用mon0 osd0 osd1主机名在后边创建过程中,提到和远程hostname不匹配,OSD节点id是从0开始,所以主机节点可从osd0开始
故仍使用原openstack三节点为的主机名mon0=compute osd0=controller osd1=network
(一)环境准备
1、节点IP
192.168.128.100(hostname controller,有一个分区/dev/sdc1 提供给osd)
192.168.128.102(hostname network,有一个分区/dev/sdc1 提供给osd)
192.168.128.101(hostname compute,有一个分区/dev/sdc1 提供给osd)
2、修改所有主机的/etc/hosts
#controller
192.168.128.100 controller swift1 osd0
#compute
192.168.128.101 compute mon0 osd2 client
#network
192.168.128.102 network swift2 osd1
3、在所有node上创建ceph用户(注:主机上创建了ceph用户,为方便管理,还是决定使用mengfei用户)
sudo useradd -d /home/ceph -m ceph
sudo passwd ceph
4、在每个Ceph节点中为用户增加 root 权限
echo "ceph ALL = (root) NOPASSWD:ALL" | sudo tee /etc/sudoers.d/ceph
sudo chmod 0440 /etc/sudoers.d/ceph
echo "mengfei ALL = (root) NOPASSWD:ALL" | sudo tee /etc/sudoers.d/mengfei
sudo chmod 0440 /etc/sudoers.d/mengfei
permission denied (publickey)解决方法
修改root密码后,依然拒绝root密码登录,解决方法如下:
ssh出现permission denied (publickey)问题:
修改/etc/ssh/sshd_config文件.
将其中的PermitRootLogin no修改为yes
PubkeyAuthentication yes注:不能改为no,否则在key无法使用
AuthorizedKeysFile .ssh/authorized_keys前面加上#屏蔽掉,
PasswordAuthentication no注释#掉
重启sshd即可:service sshd restart
5、配置ceph-deploy部署的无密码登录每个ceph节点
(1)在每个Ceph节点上安装一个SSH服务器
apt-get install openssh-server -y
(2)配置compute管理节点与每个Ceph节点无密码的SSH访问。(使用不同的用户,要在不同用户下建key)
root@compute:~/.ssh# ssh-keygen
Generating public/private rsa key pair.
Enter file in which to save the key (/root/.ssh/id_rsa):
Enter passphrase (empty for no passphrase):
Enter same passphrase again:
Your identification has been saved in /root/.ssh/id_rsa.
Your public key has been saved in /root/.ssh/id_rsa.pub.
The key fingerprint is:
25:6e:7c:8f:ea:4b:f1:a5:e6:f5:e7:30:fe:79:bf:08 root@compute
The key's randomart image is:
+--[ RSA 2048]----+
| |
| |
| . . |
| o o |
| S . . |
| . + = |
| . =Eo o|
| . + ..o.o+|
| .+...o*B|
+-----------------+
root@compute:~/.ssh#
(3)复制mon节点的秘钥到每个ceph节点(要指定要使用的用户名)
ssh-copy-id mengfei@compute
ssh-copy-id mengfei@controller
ssh-copy-id mengfei@network
ssh-copy-id root@compute
ssh-copy-id root@controller
ssh-copy-id root@network
root@compute:~/.ssh# ssh-copy-id mengfei@controller
/usr/bin/ssh-copy-id: INFO: attempting to log in with the new key(s), to filter out any that are already installed
/usr/bin/ssh-copy-id: INFO: 1 key(s) remain to be installed -- if you are prompted now it is to install the new keys
mengfei@controller's password:
Number of key(s) added: 1
Now try logging into the machine, with: "ssh 'mengfei@controller'"
and check to make sure that only the key(s) you wanted were added.
root@compute:~/.ssh#
(4)测试每台ceph节点不用密码是否可以登录
ssh mengfei@controller
ssh mengfei@network
ssh root@controller
ssh root@network
root@compute:~/.ssh# ssh mengfei@controller
Welcome to Ubuntu 14.04.1 LTS (GNU/Linux 3.13.0-39-generic i686)
* Documentation:https://help.ubuntu.com/
19 packages can be updated.
15 updates are security updates.
Last login: Tue Nov 25 14:07:06 2014 from compute
mengfei@controller:~$
(5)(Recommended) Modify the ~/.ssh/config file of your ceph-deploy admin node
so that ceph-deploy can log in to Ceph nodes as the user you created without requiring you to specify --username {username} each time you execute ceph-deploy. This has the added benefit of streamlining ssh and scp usage. Replace {username} with the user name you created:
注:实例中没有添加,config默认貌似没有这个文件。
Host controller
Hostname controller
User mengfei
Host node2
Hostname network
User mengfei
(二)安装ceph-deploy
1、添加release key
wget -q -O- 'https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc' | sudo apt-key add -
2、添加Ceph包到你的仓库,用一个稳定的Ceph发行版替换{ceph-stable-release}(如 cuttlefish, dumpling等)
实例:echo deb http://ceph.com/debian-{ceph-stable-release}/ $(lsb_release -sc) main | sudo tee /etc/apt/sources.list.d/ceph.list
echo deb http://ceph.com/debian-dumpling/ $(lsb_release -sc) main | sudo tee /etc/apt/sources.list.d/ceph.list
3、更新源,并安装ceph-deploy
apt-get update
apt-get install ceph-deploy
4、安装ntp (省略)
(三)安装配置ceph cluter
1、为了获得最佳效果,你的admin维护配置您的群集节点上创建一个目录。
mkdir my-cluster
mkdir /etc/ceph
cd my-cluster
2、创建一个集群
(1)要创建您的Ceph的存储集群,生成一个文件系统ID(FSID),在命令行提示符下输入以下命令,生成监视器的秘钥
ceph-deploy purgedata compute controller network
ceph-deploy forgetkeys
root@compute:/home/mengfei/my-cluster# ceph-deploy purgedata compute controller network
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy purgedata compute controller network
Purging data from cluster ceph hosts compute controller network
connected to host: compute
detect platform information from remote host
detect machine type
find the location of an executable
connected to host: controller
detect platform information from remote host
detect machine type
find the location of an executable
connected to host: network
detect platform information from remote host
detect machine type
find the location of an executable
connected to host: compute
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
purging data on compute
Running command: rm -rf --one-file-system -- /var/lib/ceph
Running command: rm -rf --one-file-system -- /etc/ceph/
connected to host: controller
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
purging data on controller
Running command: rm -rf --one-file-system -- /var/lib/ceph
Running command: rm -rf --one-file-system -- /etc/ceph/
connected to host: network
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
purging data on network
Running command: rm -rf --one-file-system -- /var/lib/ceph
Running command: rm -rf --one-file-system -- /etc/ceph/
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph-deploy forgetkeys
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy forgetkeys
root@compute:/home/mengfei/my-cluster#
(2)在管理模式下,请使用ceph-deploy创建集群
注:当前目录下会生成ceph.conf ceph.mon.keyring ceph.log 配置文件,密钥环,日志文件
cd /home/mengfei/my-cluster
ceph-deploy new compute
(注:应该先在主节点上先创建集群ceph,应该先new compute创建,后边再install)
root@compute:/home/mengfei/my-cluster# ceph-deploy new compute
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy new compute
Creating new cluster named ceph
making sure passwordless SSH succeeds
connected to host: compute
detect platform information from remote host
detect machine type
find the location of an executable
Running command: /bin/ip link show
Running command: /bin/ip addr show
IP addresses found: ['192.168.122.1', '192.168.128.101', '10.10.10.101']
Resolving host compute
Monitor compute at 192.168.128.101
Monitor initial members are ['compute']
Monitor addrs are ['192.168.128.101']
Creating a random mon key...
Writing monitor keyring to ceph.mon.keyring...
Writing initial config to ceph.conf...
root@compute:/home/mengfei/my-cluster#
(3)安装Ceph
ceph-deploy install compute controller network
ceph-deploy uninstall compute controller network 如果需要重装,可以此两条命令删除ceph
apt-get remove --purge ceph ceph-common ceph-mds
root@compute:/home/mengfei/my-cluster# ceph-deploy install compute controller network
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy install compute controller network
Installing stable version firefly on cluster ceph hosts compute controller network
Detecting platform for host compute ...
connected to host: compute
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
installing ceph on compute
Running command: env DEBIAN_FRONTEND=noninteractive apt-get -q install --assume-yes ca-certificates
Reading package lists...
Building dependency tree...
Reading state information...
ca-certificates is already the newest version.
0 upgraded, 0 newly installed, 0 to remove and 48 not upgraded.
Running command: wget -O release.asc https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
--2014-11-26 17:36:27--https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
Resolving ceph.com (ceph.com)... 208.113.241.137, 2607:f298:4:147::b05:fe2a
Connecting to ceph.com (ceph.com)|208.113.241.137|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified
Saving to: ‘release.asc’
0K . 19.7M=0s
2014-11-26 17:36:28 (19.7 MB/s) - ‘release.asc’ saved
Running command: apt-key add release.asc
OK
add deb repo to sources.list
Running command: apt-get -q update
Ign http://cn.archive.ubuntu.com trusty InRelease
Hit http://ceph.com trusty InRelease
Ign http://cn.archive.ubuntu.com trusty-updates InRelease
Ign http://ubuntu-cloud.archive.canonical.com trusty-updates/juno InRelease
Ign http://cn.archive.ubuntu.com trusty-backports InRelease
Hit http://ceph.com trusty/main i386 Packages
Hit http://ubuntu-cloud.archive.canonical.com trusty-updates/juno Release.gpg
Ign http://cn.archive.ubuntu.com trusty-security InRelease
Hit http://ubuntu-cloud.archive.canonical.com trusty-updates/juno Release
..................一系列包的源网址,太长了,就省略显示了。。
Ign http://cn.archive.ubuntu.com trusty/universe Translation-en_US
Reading package lists...
Running command: env DEBIAN_FRONTEND=noninteractive DEBIAN_PRIORITY=critical apt-get -q -o Dpkg::Options::=--force-confnew --no-install-recommends --assume-yes install -- ceph ceph-mds ceph-common ceph-fs-common gdisk
Reading package lists...
Building dependency tree...
Reading state information...
gdisk is already the newest version.
ceph is already the newest version.
ceph-common is already the newest version.
ceph-fs-common is already the newest version.
ceph-mds is already the newest version.
0 upgraded, 0 newly installed, 0 to remove and 48 not upgraded.
Running command: ceph --version
ceph version 0.80.7 (6c0127fcb58008793d3c8b62d925bc91963672a3)
Detecting platform for host controller ...
connected to host: controller
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
installing ceph on controller
Running command: env DEBIAN_FRONTEND=noninteractive apt-get -q install --assume-yes ca-certificates
Reading package lists...
Building dependency tree...
Reading state information...
ca-certificates is already the newest version.
0 upgraded, 0 newly installed, 0 to remove and 17 not upgraded.
Running command: wget -O release.asc https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
--2014-11-26 17:37:17--https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
Resolving ceph.com (ceph.com)... 208.113.241.137, 2607:f298:4:147::b05:fe2a
Connecting to ceph.com (ceph.com)|208.113.241.137|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified
Saving to: ‘release.asc’
0K . 34.4M=0s
2014-11-26 17:37:19 (34.4 MB/s) - ‘release.asc’ saved
Running command: apt-key add release.asc
OK
add deb repo to sources.list
Running command: apt-get -q update
Hit http://ceph.com trusty InRelease
Ign http://downloads-distro.mongodb.org dist InRelease
..................一系列包的源网址,太长了,就省略显示了。。
Ign http://cn.archive.ubuntu.com trusty/restricted Translation-en_US
Ign http://cn.archive.ubuntu.com trusty/universe Translation-en_US
Fetched 361 kB in 27s (13.1 kB/s)
Reading package lists...
W: GPG error: http://downloads-distro.mongodb.org dist Release: The following signatures couldn't be verified because the public key is not available: NO_PUBKEY 9ECBEC467F0CEB10
Running command: env DEBIAN_FRONTEND=noninteractive DEBIAN_PRIORITY=critical apt-get -q -o Dpkg::Options::=--force-confnew --no-install-recommends --assume-yes install -- ceph ceph-mds ceph-common ceph-fs-common gdisk
Reading package lists...
Building dependency tree...
Reading state information...
gdisk is already the newest version.
ceph is already the newest version.
ceph-common is already the newest version.
ceph-fs-common is already the newest version.
ceph-mds is already the newest version.
0 upgraded, 0 newly installed, 0 to remove and 17 not upgraded.
Running command: ceph --version
ceph version 0.80.7 (6c0127fcb58008793d3c8b62d925bc91963672a3)
Detecting platform for host network ...
connected to host: network
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
installing ceph on network
Running command: env DEBIAN_FRONTEND=noninteractive apt-get -q install --assume-yes ca-certificates
Reading package lists...
Building dependency tree...
Reading state information...
ca-certificates is already the newest version.
0 upgraded, 0 newly installed, 0 to remove and 36 not upgraded.
Running command: wget -O release.asc https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
--2014-11-26 17:37:52--https://ceph.com/git/?p=ceph.git;a=blob_plain;f=keys/release.asc
Resolving ceph.com (ceph.com)... 208.113.241.137, 2607:f298:4:147::b05:fe2a
Connecting to ceph.com (ceph.com)|208.113.241.137|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified
Saving to: ‘release.asc’
0K . 16.1M=0s
2014-11-26 17:38:02 (16.1 MB/s) - ‘release.asc’ saved
Running command: apt-key add release.asc
OK
add deb repo to sources.list
Running command: apt-get -q update
Hit http://ceph.com trusty InRelease
Ign http://security.ubuntu.com trusty-security InRelease
..................一系列包的源网址,太长了,就省略显示了。。
Ign http://cn.archive.ubuntu.com trusty/restricted Translation-en_US
Ign http://cn.archive.ubuntu.com trusty/universe Translation-en_US
Fetched 361 kB in 41s (8,704 B/s)
Reading package lists...
W: Duplicate sources.list entry http://cn.archive.ubuntu.com/ubuntu/ trusty/main i386 Packages (/var/lib/apt/lists/cn.archive.ubuntu.com_ubuntu_dists_trusty_main_binary-i386_Packages)
Running command: env DEBIAN_FRONTEND=noninteractive DEBIAN_PRIORITY=critical apt-get -q -o Dpkg::Options::=--force-confnew --no-install-recommends --assume-yes install -- ceph ceph-mds ceph-common ceph-fs-common gdisk
Reading package lists...
Building dependency tree...
Reading state information...
gdisk is already the newest version.
ceph is already the newest version.
ceph-common is already the newest version.
ceph-fs-common is already the newest version.
ceph-mds is already the newest version.
0 upgraded, 0 newly installed, 0 to remove and 36 not upgraded.
Running command: ceph --version
ceph version 0.80.7 (6c0127fcb58008793d3c8b62d925bc91963672a3)
root@compute:/home/mengfei/my-cluster#
(4)增加一个Ceph集群监视器
ceph-deploy mon create compute
root@compute:/home/mengfei/my-cluster# ceph-deploy mon create compute
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy mon create compute
keyring (ceph.mon.keyring) not found, creating a new one
Creating a random mon key...
Writing monitor keyring to ceph.mon.keyring...
Deploying mon, cluster ceph hosts compute
detecting platform for host compute ...
connected to host: compute
detect platform information from remote host
detect machine type
distro info: Ubuntu 14.04 trusty
determining if provided host has same hostname in remote
get remote short hostname
deploying mon to compute
get remote short hostname
remote hostname: compute
write cluster configuration to /etc/ceph/{cluster}.conf
create the mon path if it does not exist
checking for done path: /var/lib/ceph/mon/ceph-compute/done
done path does not exist: /var/lib/ceph/mon/ceph-compute/done
creating keyring file: /var/lib/ceph/tmp/ceph-compute.mon.keyring
create the monitor keyring file
Running command: ceph-mon --cluster ceph --mkfs -i compute --keyring /var/lib/ceph/tmp/ceph-compute.mon.keyring
ceph-mon: mon.noname-a 192.168.128.101:6789/0 is local, renaming to mon.compute
ceph-mon: set fsid to a15c8476-cd50-4609-bfc7-bc49a5d24f8c
ceph-mon: created monfs at /var/lib/ceph/mon/ceph-compute for mon.compute
unlinking keyring file /var/lib/ceph/tmp/ceph-compute.mon.keyring
create a done file to avoid re-doing the mon deployment
create the init path if it does not exist
locating the `service` executable...
Running command: initctl emit ceph-mon cluster=ceph id=compute
Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.compute.asok mon_status
********************************************************************************
status for monitor: mon.compute
{
"election_epoch": 2,
"extra_probe_peers": [],
"monmap": {
"created": "0.000000",
"epoch": 1,
"fsid": "a15c8476-cd50-4609-bfc7-bc49a5d24f8c",
"modified": "0.000000",
"mons": [
{
"addr": "192.168.128.101:6789/0",
"name": "compute",
"rank": 0
}
]
},
"name": "compute",
"outside_quorum": [],
"quorum": [
0
],
"rank": 0,
"state": "leader",
"sync_provider": []
}
********************************************************************************
monitor: mon.compute is running
Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.compute.asok mon_status
root@compute:/home/mengfei/my-cluster#
(5)收集密钥
ceph-deploy gatherkeys compute
一旦你收集到密钥,在本地目录下可看到如下密钥环文件:
1. {cluster-name}.client.admin.keyring
2. {cluster-name}.bootstrap-osd.keyring
3. {cluster-name}.bootstrap-mds.keyring
root@compute:/home/mengfei/my-cluster# ceph-deploy gatherkeys compute
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy gatherkeys compute
Checking compute for /etc/ceph/ceph.client.admin.keyring
connected to host: compute
detect platform information from remote host
detect machine type
fetch remote file
Got ceph.client.admin.keyring key from compute.
Have ceph.mon.keyring
Checking compute for /var/lib/ceph/bootstrap-osd/ceph.keyring
connected to host: compute
detect platform information from remote host
detect machine type
fetch remote file
Got ceph.bootstrap-osd.keyring key from compute.
Checking compute for /var/lib/ceph/bootstrap-mds/ceph.keyring
connected to host: compute
detect platform information from remote host
detect machine type
fetch remote file
Got ceph.bootstrap-mds.keyring key from compute.
root@compute:/home/mengfei/my-cluster#
(6)创建osd目录挂载点
注:disk是5G,这里只划出1G,剩余空间暂时留作它用。
ssh root@controller 也就是osd0
创建磁盘分区
fdisk /dev/sdc 注:下边有输出记录
创建挂载点
mkdir -p /var/lib/ceph/osd/ceph-osd0
格式化分区:荐用xfs或btrfs文件系统,命令是mkfs
mkfs.xfs -f /dev/sdc1
mount /dev/sdc1 /var/lib/ceph/osd/ceph-osd0 注:加-o user_xattr 报错,提示bad option
mount -o remount,user_xattr /var/lib/ceph/osd/ceph-osd0 注:文件系统上添加user_xattr选项,remount不需要完全卸载文件系统
vi /etc/fstab
/dev/sdc1 /var/lib/ceph/osd/ceph-osd0 xfs defaults 0 0 注:自已添加,官方文档没此步骤
/dev/sdc1 /var/lib/ceph/osd/ceph-osd0 xfs remount,user_xattr 0 0
root@controller:/home/mengfei#fdisk /dev/sdc
Command (m for help): n
Partition type:
p primary (0 primary, 0 extended, 4 free)
e extended
Select (default p): p
Partition number (1-4, default 1): 1
First sector (2048-10485759, default 2048):
Using default value 2048
Last sector, +sectors or +size{K,M,G} (2048-10485759, default 10485759): 2097151
Command (m for help): p
Device Boot Start End Blocks IdSystem
/dev/sdc1 2048 2097151 1047552 83Linux
Command (m for help): w
The partition table has been altered!
Calling ioctl() to re-read partition table.
Syncing disks.
root@controller:/home/mengfei#
root@controller:/home/mengfei# mkfs.xfs -f /dev/sdc1
meta-data=/dev/sdc1 isize=256 agcount=4, agsize=65472 blks
= sectsz=512 attr=2, projid32bit=0
data = bsize=4096 blocks=261888, imaxpct=25
= sunit=0 swidth=0 blks
naming =version 2 bsize=4096 ascii-ci=0
log =internal log bsize=4096 blocks=1200, version=2
= sectsz=512 sunit=0 blks, lazy-count=1
realtime =none extsz=4096 blocks=0, rtextents=0
root@controller:/home/mengfei#
ssh root@network 也就是osd1
创建磁盘分区
fdisk /dev/sdc 注:下边有输出记录
创建挂载点
mkdir -p /var/lib/ceph/osd/ceph-osd1
格式化分区:荐用xfs或btrfs文件系统,命令是mkfs
mkfs.xfs -f /dev/sdc1
mount /dev/sdc1 /var/lib/ceph/osd/ceph-osd1 注:加-o user_xattr 报错,提示bad option
mount -o remount,user_xattr /var/lib/ceph/osd/ceph-osd1 注:文件系统上添加user_xattr选项,remount不需要完全卸载文件系统
vi /etc/fstab
#/dev/sdc1 /var/lib/ceph/osd/ceph-osd1 xfs user_xattr 0 0 注:自已添加,官方文档没此步骤
/dev/sdc1 /var/lib/ceph/osd/ceph-osd1 xfs rw 0 0
root@controller:/home/mengfei#fdisk /dev/sdc
Command (m for help): n
Partition type:
p primary (0 primary, 0 extended, 4 free)
e extended
Select (default p): p
Partition number (1-4, default 1): 1
First sector (2048-10485759, default 2048):
Using default value 2048
Last sector, +sectors or +size{K,M,G} (2048-10485759, default 10485759): 2097151
Command (m for help): p
Disk /dev/sdc: 5368 MB, 5368709120 bytes
255 heads, 63 sectors/track, 652 cylinders, total 10485760 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0xfbd1ab98
Device Boot Start End Blocks IdSystem
/dev/sdc1 2048 2097151 1047552 83Linux
Command (m for help): w
The partition table has been altered!
Calling ioctl() to re-read partition table.
Syncing disks.
root@network:/home/mengfei#
root@network:/home/mengfei# mkfs.xfs -f /dev/sdc1
meta-data=/dev/sdc1 isize=256 agcount=4, agsize=65472 blks
= sectsz=512 attr=2, projid32bit=0
data = bsize=4096 blocks=261888, imaxpct=25
= sunit=0 swidth=0 blks
naming =version 2 bsize=4096 ascii-ci=0
log =internal log bsize=4096 blocks=1200, version=2
= sectsz=512 sunit=0 blks, lazy-count=1
realtime =none extsz=4096 blocks=0, rtextents=0
root@network:/home/mengfei#
(7)管理模式下添加OSD节点并激活OSD
cd /home/mengfei/my-cluster
注:一定要到此目录下执行,因为创建集群ceph时会自动在此目录下生成ceph.conf.运行ceph-deploy时会自动分发,不在此目录下执行会提示“Cannot load config”
有些配置是需要在my-cluster/ceph.conf修改的,比如:ceph-osd0/journal 默认可能需要很大,所以我就在my-cluter/ceph.conf做了修改:
osd journal size = 100 journal大小100M,如果mount点够大,快速安装就无所谓了,我的空间小,就设定了100
osd pool default size = 3 (配置存储对象副本数=对象+副本)
osd pool default min_size = 1(配置存储对象最小副本数)
osd crush chooseleaf type = 1(使用在CRUSH规则chooseleaf斗式。使用序号名称而非军衔,默认是1)
ceph-deploy osd prepare controller:/var/lib/ceph/osd/ceph-osd0
ceph-deploy osd prepare network:/var/lib/ceph/osd/ceph-osd1
ceph-deploy osd activate controller:/var/lib/ceph/osd/ceph-osd0
ceph-deploy osd activate network:/var/lib/ceph/osd/ceph-osd1
注:有时执行时会提示--overwirte-conf
root@compute:/home/mengfei/my-cluster# ceph-deploy osd prepare controller:/var/lib/ceph/osd/ceph-osd0
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy --overwrite-conf osd prepare controller:/var/lib/ceph/osd/ceph-osd0
Preparing cluster ceph disks controller:/var/lib/ceph/osd/ceph-osd0:
connected to host: controller
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
Deploying osd to controller
write cluster configuration to /etc/ceph/{cluster}.conf
Running command: udevadm trigger --subsystem-match=block --action=add
Preparing host controller disk /var/lib/ceph/osd/ceph-osd0 journal None activate False
Running command: ceph-disk -v prepare --fs-type xfs --cluster ceph -- /var/lib/ceph/osd/ceph-osd0
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mkfs_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mkfs_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mount_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mount_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=osd_journal_size
DEBUG:ceph-disk:Preparing osd data dir /var/lib/ceph/osd/ceph-osd0
checking OSD status...
Running command: ceph --cluster=ceph osd stat --format=json
Host controller is now ready for osd use.
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph-deploy osd activate controller:/var/lib/ceph/osd/ceph-osd0
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy osd activate controller:/var/lib/ceph/osd/ceph-osd0
Activating cluster ceph disks controller:/var/lib/ceph/osd/ceph-osd0:
connected to host: controller
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
activating host controller disk /var/lib/ceph/osd/ceph-osd0
will use init type: upstart
Running command: ceph-disk -v activate --mark-init upstart --mount /var/lib/ceph/osd/ceph-osd0
DEBUG:ceph-disk:Cluster uuid is a15c8476-cd50-4609-bfc7-bc49a5d24f8c
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
DEBUG:ceph-disk:Cluster name is ceph
DEBUG:ceph-disk:OSD uuid is ec6d0ec3-9c44-4bec-80cb-24709ec03ea1
DEBUG:ceph-disk:Allocating OSD id...
INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring osd create --concise ec6d0ec3-9c44-4bec-80cb-24709ec03ea1
DEBUG:ceph-disk:OSD id is 4
DEBUG:ceph-disk:Initializing OSD...
INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring mon getmap -o /var/lib/ceph/osd/ceph-osd0/activate.monmap
got monmap epoch 1
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster ceph --mkfs --mkkey -i 4 --monmap /var/lib/ceph/osd/ceph-osd0/activate.monmap --osd-data /var/lib/ceph/osd/ceph-osd0 --osd-journal /var/lib/ceph/osd/ceph-osd0/journal --osd-uuid ec6d0ec3-9c44-4bec-80cb-24709ec03ea1 --keyring /var/lib/ceph/osd/ceph-osd0/keyring
2014-11-27 15:23:41.011243 b684b740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
2014-11-27 15:23:41.261893 b684b740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
2014-11-27 15:23:41.291819 b684b740 -1 filestore(/var/lib/ceph/osd/ceph-osd0) could not find 23c2fcde/osd_superblock/0//-1 in index: (2) No such file or directory
2014-11-27 15:23:41.300771 b684b740 -1 created object store /var/lib/ceph/osd/ceph-osd0 journal /var/lib/ceph/osd/ceph-osd0/journal for osd.4 fsid a15c8476-cd50-4609-bfc7-bc49a5d24f8c
2014-11-27 15:23:41.300858 b684b740 -1 auth: error reading file: /var/lib/ceph/osd/ceph-osd0/keyring: can't open /var/lib/ceph/osd/ceph-osd0/keyring: (2) No such file or directory
2014-11-27 15:23:41.301001 b684b740 -1 created new key in keyring /var/lib/ceph/osd/ceph-osd0/keyring
DEBUG:ceph-disk:Marking with init system upstart
DEBUG:ceph-disk:Authorizing OSD key...
INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring auth add osd.4 -i /var/lib/ceph/osd/ceph-osd0/keyring osd allow * mon allow profile osd
added key for osd.4
DEBUG:ceph-disk:ceph osd.4 data dir is ready at /var/lib/ceph/osd/ceph-osd0
DEBUG:ceph-disk:Creating symlink /var/lib/ceph/osd/ceph-4 -> /var/lib/ceph/osd/ceph-osd0
DEBUG:ceph-disk:Starting ceph osd.4...
INFO:ceph-disk:Running command: /sbin/initctl emit --no-wait -- ceph-osd cluster=ceph id=4
checking OSD status...
Running command: ceph --cluster=ceph osd stat --format=json
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph-deploy osd prepare network:/var/lib/ceph/osd/ceph-osd1
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy osd prepare network:/var/lib/ceph/osd/ceph-osd1
Preparing cluster ceph disks network:/var/lib/ceph/osd/ceph-osd1:
connected to host: network
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
Deploying osd to network
write cluster configuration to /etc/ceph/{cluster}.conf
osd keyring does not exist yet, creating one
create a keyring file
Running command: udevadm trigger --subsystem-match=block --action=add
Preparing host network disk /var/lib/ceph/osd/ceph-osd1 journal None activate False
Running command: ceph-disk -v prepare --fs-type xfs --cluster ceph -- /var/lib/ceph/osd/ceph-osd1
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mkfs_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mkfs_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mount_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mount_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=osd_journal_size
DEBUG:ceph-disk:Preparing osd data dir /var/lib/ceph/osd/ceph-osd1
checking OSD status...
Running command: ceph --cluster=ceph osd stat --format=json
Host network is now ready for osd use.
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph-deploy osd activate network:/var/lib/ceph/osd/ceph-osd1
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy osd activate network:/var/lib/ceph/osd/ceph-osd1
Activating cluster ceph disks network:/var/lib/ceph/osd/ceph-osd1:
connected to host: network
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
activating host network disk /var/lib/ceph/osd/ceph-osd1
will use init type: upstart
Running command: ceph-disk -v activate --mark-init upstart --mount /var/lib/ceph/osd/ceph-osd1
DEBUG:ceph-disk:Cluster uuid is 8b2af1e6-92eb-4d74-9ca5-057522bb738f
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
DEBUG:ceph-disk:Cluster name is ceph
DEBUG:ceph-disk:OSD uuid is c8b2811c-fb19-49c3-b630-374a4db7073e
DEBUG:ceph-disk:Allocating OSD id...
INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring osd create --concise c8b2811c-fb19-49c3-b630-374a4db7073e
DEBUG:ceph-disk:OSD id is 1
DEBUG:ceph-disk:Initializing OSD...
INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring mon getmap -o /var/lib/ceph/osd/ceph-osd1/activate.monmap
got monmap epoch 1
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster ceph --mkfs --mkkey -i 1 --monmap /var/lib/ceph/osd/ceph-osd1/activate.monmap --osd-data /var/lib/ceph/osd/ceph-osd1 --osd-journal /var/lib/ceph/osd/ceph-osd1/journal --osd-uuid c8b2811c-fb19-49c3-b630-374a4db7073e --keyring /var/lib/ceph/osd/ceph-osd1/keyring
2014-11-27 16:27:23.448198 b67e2740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
2014-11-27 16:27:23.824770 b67e2740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
2014-11-27 16:27:23.865648 b67e2740 -1 filestore(/var/lib/ceph/osd/ceph-osd1) could not find 23c2fcde/osd_superblock/0//-1 in index: (2) No such file or directory
2014-11-27 16:27:23.885991 b67e2740 -1 created object store /var/lib/ceph/osd/ceph-osd1 journal /var/lib/ceph/osd/ceph-osd1/journal for osd.1 fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
2014-11-27 16:27:23.887571 b67e2740 -1 auth: error reading file: /var/lib/ceph/osd/ceph-osd1/keyring: can't open /var/lib/ceph/osd/ceph-osd1/keyring: (2) No such file or directory
2014-11-27 16:27:23.890124 b67e2740 -1 created new key in keyring /var/lib/ceph/osd/ceph-osd1/keyring
DEBUG:ceph-disk:Marking with init system upstart
DEBUG:ceph-disk:Authorizing OSD key...
INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring auth add osd.1 -i /var/lib/ceph/osd/ceph-osd1/keyring osd allow * mon allow profile osd
added key for osd.1
DEBUG:ceph-disk:ceph osd.1 data dir is ready at /var/lib/ceph/osd/ceph-osd1
DEBUG:ceph-disk:Creating symlink /var/lib/ceph/osd/ceph-1 -> /var/lib/ceph/osd/ceph-osd1
DEBUG:ceph-disk:Starting ceph osd.1...
INFO:ceph-disk:Running command: /sbin/initctl emit --no-wait -- ceph-osd cluster=ceph id=1
checking OSD status...
Running command: ceph --cluster=ceph osd stat --format=json
root@compute:/home/mengfei/my-cluster#
(8)复制配置文件和管理密钥到管理节点和你的Ceph节点
注:使用ceph-deploy命令将配置文件和管理密钥复制到管理节点和你的Ceph节点。
下次你再使用ceph命令界面时就无需指定集群监视器地址,执行命令时也无需每次都指定ceph.client.admin.keyring
ceph-deploy admin compute controller network (注:有时提示需要--overwrite-conf,实例中需要指定)
root@compute:/home/mengfei/my-cluster# ceph-deploy admin compute controller network
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy --overwrite-conf admin compute controller network
Pushing admin keys and conf to compute
connected to host: compute
detect platform information from remote host
detect machine type
get remote short hostname
write cluster configuration to /etc/ceph/{cluster}.conf
Pushing admin keys and conf to controller
connected to host: controller
detect platform information from remote host
detect machine type
get remote short hostname
write cluster configuration to /etc/ceph/{cluster}.conf
Pushing admin keys and conf to network
connected to host: network
detect platform information from remote host
detect machine type
get remote short hostname
write cluster configuration to /etc/ceph/{cluster}.conf
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster#
(9)验证osd
ceph osd tree 查看状态
ceph osd dump 查看osd配置信息
ceph osd rm 删除节点 remove osd(s) <id> [<id>...]
ceph osd crush rm osd.0 在集群中删除一个osd 硬盘 crush map
ceph osd crush rm node1 在集群中删除一个osd的host节点
root@compute:/home/mengfei/my-cluster# ceph osd tree (weight默认是0)
# id weighttype name up/down reweight
-1 0 root default
-2 0 host controller
0 0 osd.0 up 1
-3 0 host network
1 0 osd.1 up 1
root@compute:/home/mengfei/my-cluster#
root@compute:/var/log/ceph# ceph osd dump
epoch 89
fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
created 2014-11-27 16:22:54.085639
modified 2014-11-28 23:39:44.056533
flags
pool 0 'data' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 89 flags hashpspool crash_replay_interval 45 stripe_width 0
pool 1 'metadata' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 88 flags hashpspool stripe_width 0
pool 2 'rbd' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 87 flags hashpspool stripe_width 0
max_osd 2
osd.0 up inweight 0 up_from 32 up_thru 82 down_at 31 last_clean_interval [15,29) 192.168.128.100:6800/2811 192.168.128.100:6801/2811 192.168.128.100:6802/2811 192.168.128.100:6803/2811 exists,up f4707c04-aeca-46fe-bf0e-f7e2d43d0524
osd.1 up inweight 0 up_from 33 up_thru 82 down_at 29 last_clean_interval [14,28) 192.168.128.102:6800/3105 192.168.128.102:6801/3105 192.168.128.102:6802/3105 192.168.128.102:6803/3105 exists,up c8b2811c-fb19-49c3-b630-374a4db7073e
root@compute:/var/log/ceph#
接上。。。。
(三)扩展集群
compute(mon0):增加一个osd进程osd2 和一个元数据服务器mds0
controller(osd0):增加一个监视器服务器mon1
network(osd1):增加一个监视器服务器mon2
注:多个监视器服务器可以生成quoraum
1. 在compute上增加OSD节点
(1)compute节点创建osd2目录
ssh compute
mkdir -p /var/lib/ceph/osd/ceph-osd2
fdisk /dev/sdc
mkfs.rfs -f /dev/sdc1
mount/dev/sdc1 /var/lib/ceph/osd/ceph-osd2
mount -o remount,user_xattr/dev/sdc1 /var/lib/ceph/osd/ceph-osd2
vi /etc/fstab
/dev/sdc1 /var/lib/ceph/osd/ceph-osd2 xfs defaults 0 0
/dev/sdc1 /var/lib/ceph/osd/ceph-osd2 xfs remount,user_xattr 0 0
(2)在管理节点compute上,准备OSD
cd /home/mengfei/my-cluster
ceph-deploy osd prepare compute:/var/lib/ceph/osd/ceph-osd2
ceph-deploy osd activate compute:/var/lib/ceph/osd/ceph-osd2
root@compute:/home/mengfei/my-cluster# ceph-deploy osd prepare compute:/var/lib/ceph/osd/ceph-osd2
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy osd prepare compute:/var/lib/ceph/osd/ceph-osd2
Preparing cluster ceph disks compute:/var/lib/ceph/osd/ceph-osd2:
connected to host: compute
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
Deploying osd to compute
write cluster configuration to /etc/ceph/{cluster}.conf
Running command: udevadm trigger --subsystem-match=block --action=add
Preparing host compute disk /var/lib/ceph/osd/ceph-osd2 journal None activate False
Running command: ceph-disk -v prepare --fs-type xfs --cluster ceph -- /var/lib/ceph/osd/ceph-osd2
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mkfs_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mkfs_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_mount_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-conf --cluster=ceph --name=osd. --lookup osd_fs_mount_options_xfs
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=osd_journal_size
DEBUG:ceph-disk:Preparing osd data dir /var/lib/ceph/osd/ceph-osd2
checking OSD status...
Running command: ceph --cluster=ceph osd stat --format=json
Host compute is now ready for osd use.
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph-deploy osd activate compute:/var/lib/ceph/osd/ceph-osd2
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy osd activate compute:/var/lib/ceph/osd/ceph-osd2
Activating cluster ceph disks compute:/var/lib/ceph/osd/ceph-osd2:
connected to host: compute
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
activating host compute disk /var/lib/ceph/osd/ceph-osd2
will use init type: upstart
Running command: ceph-disk -v activate --mark-init upstart --mount /var/lib/ceph/osd/ceph-osd2
DEBUG:ceph-disk:Cluster uuid is 8b2af1e6-92eb-4d74-9ca5-057522bb738f
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster=ceph --show-config-value=fsid
DEBUG:ceph-disk:Cluster name is ceph
DEBUG:ceph-disk:OSD uuid is 032998d3-03b5-458d-b32b-de48305e5b59
DEBUG:ceph-disk:Allocating OSD id...
INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring osd create --concise 032998d3-03b5-458d-b32b-de48305e5b59
DEBUG:ceph-disk:OSD id is 2
DEBUG:ceph-disk:Initializing OSD...
INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring mon getmap -o /var/lib/ceph/osd/ceph-osd2/activate.monmap
got monmap epoch 1
INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster ceph --mkfs --mkkey -i 2 --monmap /var/lib/ceph/osd/ceph-osd2/activate.monmap --osd-data /var/lib/ceph/osd/ceph-osd2 --osd-journal /var/lib/ceph/osd/ceph-osd2/journal --osd-uuid 032998d3-03b5-458d-b32b-de48305e5b59 --keyring /var/lib/ceph/osd/ceph-osd2/keyring
2014-11-28 14:32:34.800238 b6822740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
2014-11-28 14:32:35.280160 b6822740 -1 journal FileJournal::_open: disabling aio for non-block journal.Use journal_force_aio to force use of aio anyway
2014-11-28 14:32:35.304026 b6822740 -1 filestore(/var/lib/ceph/osd/ceph-osd2) could not find 23c2fcde/osd_superblock/0//-1 in index: (2) No such file or directory
2014-11-28 14:32:35.370476 b6822740 -1 created object store /var/lib/ceph/osd/ceph-osd2 journal /var/lib/ceph/osd/ceph-osd2/journal for osd.2 fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
2014-11-28 14:32:35.370543 b6822740 -1 auth: error reading file: /var/lib/ceph/osd/ceph-osd2/keyring: can't open /var/lib/ceph/osd/ceph-osd2/keyring: (2) No such file or directory
2014-11-28 14:32:35.370712 b6822740 -1 created new key in keyring /var/lib/ceph/osd/ceph-osd2/keyring
DEBUG:ceph-disk:Marking with init system upstart
DEBUG:ceph-disk:Authorizing OSD key...
INFO:ceph-disk:Running command: /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring auth add osd.2 -i /var/lib/ceph/osd/ceph-osd2/keyring osd allow * mon allow profile osd
added key for osd.2
DEBUG:ceph-disk:ceph osd.2 data dir is ready at /var/lib/ceph/osd/ceph-osd2
DEBUG:ceph-disk:Creating symlink /var/lib/ceph/osd/ceph-2 -> /var/lib/ceph/osd/ceph-osd2
DEBUG:ceph-disk:Starting ceph osd.2...
INFO:ceph-disk:Running command: /sbin/initctl emit --no-wait -- ceph-osd cluster=ceph id=2
checking OSD status...
Running command: ceph --cluster=ceph osd stat --format=json
root@compute:/home/mengfei/my-cluster#
(3)增加OSD节点后,查看集群重新平衡状态
ceph osd tree
ceph -w
ceph -s
ceph osd dump
root@compute:/home/mengfei/my-cluster# ceph osd tree (weight默认是0)
# id weighttype name up/down reweight
-1 0 root default
-2 0 host controller
0 0 osd.0 up 1
-3 0 host network
1 0 osd.1 up 1
-4 0 host compute
2 0 osd.2 up 1
root@compute:/home/mengfei/my-cluster#
[root@compute:/home/mengfei/my-cluster# ceph -w (由于没修改weight权重值,所以下边状态是192 creating+incomplete)
cluster 8b2af1e6-92eb-4d74-9ca5-057522bb738f
health HEALTH_WARN 192 pgs incomplete; 192 pgs stuck inactive; 192 pgs stuck unclean; 50 requests are blocked > 32 sec
monmap e3: 3 mons at {compute=192.168.128.101:6789/0,controller=192.168.128.100:6789/0,network=192.168.128.102:6789/0}, election epoch 6, quorum 0,1,2 controller,compute,network
mdsmap e5: 1/1/1 up {0=compute=up:creating}
osdmap e23: 3 osds: 3 up, 3 in
pgmap v50: 192 pgs, 3 pools, 0 bytes data, 0 objects
398 MB used, 2656 MB / 3054 MB avail
192 creating+incomplete
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph osd dump
epoch 23
fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
created 2014-11-27 16:22:54.085639
modified 2014-11-28 16:30:06.501906
flags
pool 0 'data' replicated size 3 min_size 2 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 1 flags hashpspool crash_replay_interval 45 stripe_width 0
pool 1 'metadata' replicated size 3 min_size 2 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 1 flags hashpspool stripe_width 0
pool 2 'rbd' replicated size 3 min_size 2 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 1 flags hashpspool stripe_width 0
max_osd 3
osd.0 up inweight 0 up_from 15 up_thru 15 down_at 12 last_clean_interval [4,11) 192.168.128.100:6800/3272 192.168.128.100:6801/3272 192.168.128.100:6802/3272 192.168.128.100:6803/3272 exists,up f4707c04-aeca-46fe-bf0e-f7e2d43d0524
osd.1 up inweight 0 up_from 14 up_thru 0 down_at 13 last_clean_interval [8,12) 192.168.128.102:6800/3272 192.168.128.102:6801/3272 192.168.128.102:6802/3272 192.168.128.102:6803/3272 exists,up c8b2811c-fb19-49c3-b630-374a4db7073e
osd.2 up inweight 0 up_from 22 up_thru 0 down_at 21 last_clean_interval [19,19) 192.168.128.101:6801/16367 192.168.128.101:6802/16367 192.168.128.101:6803/16367 192.168.128.101:6804/16367 exists,up 032998d3-03b5-458d-b32b-de48305e5b59
root@compute:/home/mengfei/my-cluster#
2. 在compute上增加元数据服务器
注:为使用CephFS文件系统,至少需要一台元数据服务器
注:当前Ceph产品仅支持一个元数据服务器,可尝试运行多个,但不受商业支持
ceph-deploy mds create compute
ceph mds stat查看状态
ceph mds dump查看状态
[root@compute:/home/mengfei/my-cluster# ceph-deploy mds create compute
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy mds create compute
Deploying mds, cluster ceph hosts compute:compute
connected to host: compute
detect platform information from remote host
detect machine type
Distro info: Ubuntu 14.04 trusty
remote host will use upstart
deploying mds bootstrap to compute
write cluster configuration to /etc/ceph/{cluster}.conf
create path if it doesn't exist
Running command: ceph --cluster ceph --name client.bootstrap-mds --keyring /var/lib/ceph/bootstrap-mds/ceph.keyring auth get-or-create mds.compute osd allow rwx mds allow mon allow profile mds -o /var/lib/ceph/mds/ceph-compute/keyring
Running command: initctl emit ceph-mds cluster=ceph id=compute
Unhandled exception in thread started by
[root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph mds stat
e3: 1/1/1 up {0=compute=up:creating}
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph mds dump
dumped mdsmap epoch 3
epoch 3
flags 0
created 2014-11-27 16:22:54.081490
modified 2014-11-28 14:45:35.509558
tableserver 0
root 0
session_timeout 60
session_autoclose 300
max_file_size 1099511627776
last_failure 0
last_failure_osd_epoch0
compatcompat={},rocompat={},incompat={1=base v0.20,2=client writeable ranges,3=default file layouts on dirs,4=dir inode in separate object,5=mds uses versioned encoding,6=dirfrag is stored in omap}
max_mds 1
in 0
up {0=4306}
failed
stopped
data_pools 0
metadata_pool 1
inline_data disabled
4306: 192.168.128.101:6805/7363 'compute' mds.0.1 up:creating seq 1
root@compute:/home/mengfei/my-cluster#
删除元数据:(注:删除元数据mds时,会提示以下信息,必须降低max_mds)
root@compute:/home/mengfei# ceph mds stop 0
Error EBUSY: must decrease max_mds or else MDS will immediately reactivate
root@compute:/home/mengfei#
root@compute:/home/mengfei# ceph mds set_max_mds 0 (降低max值)
max_mds = 0
root@compute:/home/mengfei#
root@compute:/home/mengfei# ceph mds stop 0
telling mds.0 192.168.128.101:6800/26057 to deactivate
root@compute:/home/mengfei#
3. 在controller=osd0/network=osd1节点增加监视器mon1和mon2
注:Ceph使用Paxos算法,需要多个Ceph监视器组成Quoram(如1,2:3,3:4,3:5,4:6等)
ceph-deploy admin create controller network (注:重新分发以下的配置文件)
ceph-deploy mon create controller network
注:执行以上命令时,提示/var/run/ceph/ceph-mon.controller.asok not found. 主要还是ceph.conf文件不对
添加相关项再push到所有节点后就正常了。
vi /home/mengfei/my-cluster/ceph.conf (以下项也并不全面,稍后再改)
fsid = 8b2af1e6-92eb-4d74-9ca5-057522bb738f
mon_initial_members = compute,controller,network
mon_host = 192.168.128.101,192.168.128.100,192.168.128.102
public network = 192.168.128.0/24
auth_cluster_required = cephx
auth_service_required = cephx
auth_client_required = cephx
#filestore_xattr_use_omap = true
osd journal size = 100
filestore_xattr_use_omap = true
osd pool default size = 3
osd pool default min_size = 1
osd crush chooseleaf type = 1
host = controller
host = network
host = compute
host = compute
mon_addr = 192.168.128.101:6789
host = controller
mon_addr = 192.168.128.100:6789
host = network
mon_addr = 192.168.128.102:6789
host = compute
root@compute:/home/mengfei/my-cluster# ceph-deploy mon create controller network
found configuration file at: /root/.cephdeploy.conf
Invoked (1.5.20): /usr/bin/ceph-deploy mon create controller network
Deploying mon, cluster ceph hosts controller network
detecting platform for host controller ...
connected to host: controller
detect platform information from remote host
detect machine type
distro info: Ubuntu 14.04 trusty
determining if provided host has same hostname in remote
get remote short hostname
deploying mon to controller
get remote short hostname
remote hostname: controller
write cluster configuration to /etc/ceph/{cluster}.conf
create the mon path if it does not exist
checking for done path: /var/lib/ceph/mon/ceph-controller/done
create a done file to avoid re-doing the mon deployment
create the init path if it does not exist
locating the `service` executable...
Running command: initctl emit ceph-mon cluster=ceph id=controller
Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.controller.asok mon_status
********************************************************************************
status for monitor: mon.controller
{
"election_epoch": 0,
"extra_probe_peers": [
"192.168.128.101:6789/0"
],
"monmap": {
"created": "0.000000",
"epoch": 1,
"fsid": "8b2af1e6-92eb-4d74-9ca5-057522bb738f",
"modified": "0.000000",
"mons": [
{
"addr": "192.168.128.101:6789/0",
"name": "compute",
"rank": 0
}
]
},
"name": "controller",
"outside_quorum": [],
"quorum": [],
"rank": -1,
"state": "probing",
"sync_provider": []
}
********************************************************************************
monitor: mon.controller is currently at the state of probing
Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.controller.asok mon_status
monitor controller does not exist in monmap
detecting platform for host network ...
connected to host: network
detect platform information from remote host
detect machine type
distro info: Ubuntu 14.04 trusty
determining if provided host has same hostname in remote
get remote short hostname
deploying mon to network
get remote short hostname
remote hostname: network
write cluster configuration to /etc/ceph/{cluster}.conf
create the mon path if it does not exist
checking for done path: /var/lib/ceph/mon/ceph-network/done
create a done file to avoid re-doing the mon deployment
create the init path if it does not exist
locating the `service` executable...
Running command: initctl emit ceph-mon cluster=ceph id=network
Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.network.asok mon_status
********************************************************************************
status for monitor: mon.network
{
"election_epoch": 1,
"extra_probe_peers": [
"192.168.128.101:6789/0"
],
"monmap": {
"created": "0.000000",
"epoch": 3,
"fsid": "8b2af1e6-92eb-4d74-9ca5-057522bb738f",
"modified": "2014-11-28 16:18:49.267793",
"mons": [
{
"addr": "192.168.128.100:6789/0",
"name": "controller",
"rank": 0
},
{
"addr": "192.168.128.101:6789/0",
"name": "compute",
"rank": 1
},
{
"addr": "192.168.128.102:6789/0",
"name": "network",
"rank": 2
}
]
},
"name": "network",
"outside_quorum": [],
"quorum": [],
"rank": 2,
"state": "electing",
"sync_provider": []
}
********************************************************************************
monitor: mon.network is running
Running command: ceph --cluster=ceph --admin-daemon /var/run/ceph/ceph-mon.network.asok mon_status
root@compute:/home/mengfei/my-cluster#
查看监视器quorum状态 (注:增加监视器后,ceph将同步各监视器并形成quarum)
ceph mon stat
ceph mon_status
ceph mon dump
ceph quorum_status
root@compute:/home/mengfei/my-cluster# ceph mon stat
e3: 3 mons at {compute=192.168.128.101:6789/0,controller=192.168.128.100:6789/0,network=192.168.128.102:6789/0}, election epoch 6, quorum 0,1,2 controller,compute,network
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph mon_status
{"name":"controller","rank":0,"state":"leader","election_epoch":6,"quorum":,"outside_quorum":[],"extra_probe_peers":["192.168.128.101:6789\/0","192.168.128.102:6789\/0"],"sync_provider":[],"monmap":{"epoch":3,"fsid":"8b2af1e6-92eb-4d74-9ca5-057522bb738f","modified":"2014-11-28 16:18:49.267793","created":"0.000000","mons":[{"rank":0,"name":"controller","addr":"192.168.128.100:6789\/0"},{"rank":1,"name":"compute","addr":"192.168.128.101:6789\/0"},{"rank":2,"name":"network","addr":"192.168.128.102:6789\/0"}]}}
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph mon dump
dumped monmap epoch 3
epoch 3
fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
last_changed 2014-11-28 16:18:49.267793
created 0.000000
0: 192.168.128.100:6789/0 mon.controller
1: 192.168.128.101:6789/0 mon.compute
2: 192.168.128.102:6789/0 mon.network
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph quorum_status
{"election_epoch":6,"quorum":,"quorum_names":["controller","compute","network"],"quorum_leader_name":"controller","monmap":{"epoch":3,"fsid":"8b2af1e6-92eb-4d74-9ca5-057522bb738f","modified":"2014-11-28 16:18:49.267793","created":"0.000000","mons":[{"rank":0,"name":"controller","addr":"192.168.128.100:6789\/0"},{"rank":1,"name":"compute","addr":"192.168.128.101:6789\/0"},{"rank":2,"name":"network","addr":"192.168.128.102:6789\/0"}]}}
root@compute:/home/mengfei/my-cluster#
(四)验证集群osd,检查集群健康状况
ceph health 查看健康状态
ceph auth list查看认证状态
ceph osd tree 查看状态
ceph -s 查看状态
ceph -w 查看实时状态(和-s内容一样)
ceph osd dump 查看osd配置信息
ceph osd rm 删除节点 remove osd(s) <id> [<id>...]
ceph osd crush rm osd.0 在集群中删除一个osd 硬盘 crush map
ceph osd crush rm node1 在集群中删除一个osd的host节点
以下是修改object副本个数及最小个数命令(也可以在ceph.conf中指定):
ceph osd pool set data size 3
ceph osd pool set metadata size 3
ceph osd pool set rbd size 3
ceph osd pool set data min_size 1
ceph osd pool set metadata min_size 1
ceph osd pool set rbd min_size 1
以下是修改允许的最大时钟差(默认情况下,实例中报ceph -w会报:clock skew detected on mon.compute,ceph health detail可看详细,修改此值为0.5,就会health oK)
mon_clock_drift_allowed = 0.5
以下是修改weight权重值命令:
ceph osd crush set 0 1.0 host=controller
ceph osd crush set 1 1.0 host=network
ceph osd crush set 2 1.0 host=compute
注:你将会看到PG状态由活跃且干净状态变成活跃态,其中存在部分降级对象。当迁移完成后,
将再次返回活跃且干净状态。(可按Control+c组合键退出)
root@compute:/var/log/ceph# ceph osd crush set 0 1.0 host=controller
set item id 0 name 'osd.0' weight 1 at location {host=controller} to crush map
root@compute:/var/log/ceph# ceph osd crush set 1 1.0 host=network
set item id 1 name 'osd.1' weight 1 at location {host=network} to crush map
root@compute:/var/log/ceph# ceph osd crush set 2 1.0 host=compute
set item id 2 name 'osd.2' weight 1 at location {host=compute} to crush map
root@compute:/var/log/ceph#
root@compute:/home/mengfei/my-cluster# ceph osd tree (weight默认是0)
# id weighttype name up/down reweight
-1 0 root default
-2 0 host controller
0 0 osd.0 up 1
-3 0 host network
1 0 osd.1 up 1
-4 0 host compute
2 0 osd.2 up 1
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph -s (inactive+unclean状态,实例中修改weight值为1之后<默认是0>,就正常了)
cluster 8b2af1e6-92eb-4d74-9ca5-057522bb738f
health HEALTH_WARN 192 pgs incomplete; 192 pgs stuck inactive; 192 pgs stuck unclean
monmap e1: 1 mons at {compute=192.168.128.101:6789/0}, election epoch 1, quorum 0 compute
osdmap e16: 2 osds: 2 up, 2 in
pgmap v31: 192 pgs, 3 pools, 0 bytes data, 0 objects
266 MB used, 1770 MB / 2036 MB avail
192 creating+incomplete
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph osd tree (weight值修改为1)
# id weighttype name up/down reweight
-1 3 root default
-2 1 host controller
0 1 osd.0 up 1
-3 1 host network
1 1 osd.1 up 1
-4 1 host compute
2 1 osd.2 up 1
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph -s (以下是修改weight=1后的输出)
cluster 8b2af1e6-92eb-4d74-9ca5-057522bb738f
health HEALTH_WARN clock skew detected on mon.compute, mon.network (这是时钟偏移报警,应该没事,修改ceph.conf mon_clock_drift_allowed = 0.5解决)
monmap e3: 3 mons at {compute=192.168.128.101:6789/0,controller=192.168.128.100:6789/0,network=192.168.128.102:6789/0}, election epoch 30, quorum 0,1,2 controller,compute,network
mdsmap e14: 1/1/1 up {0=compute=up:active}
osdmap e89: 3 osds: 3 up, 3 in
pgmap v351: 192 pgs, 3 pools, 1884 bytes data, 20 objects
406 MB used, 2648 MB / 3054 MB avail
192 active+clean
root@compute:/home/mengfei/my-cluster#
root@compute:/var/log/ceph# ceph osd dump
epoch 89
fsid 8b2af1e6-92eb-4d74-9ca5-057522bb738f
created 2014-11-27 16:22:54.085639
modified 2014-11-28 23:39:44.056533
flags
pool 0 'data' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 89 flags hashpspool crash_replay_interval 45 stripe_width 0
pool 1 'metadata' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 88 flags hashpspool stripe_width 0
pool 2 'rbd' replicated size 3 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 64 pgp_num 64 last_change 87 flags hashpspool stripe_width 0
max_osd 3
osd.0 up inweight 1 up_from 32 up_thru 82 down_at 31 last_clean_interval [15,29) 192.168.128.100:6800/2811 192.168.128.100:6801/2811 192.168.128.100:6802/2811 192.168.128.100:6803/2811 exists,up f4707c04-aeca-46fe-bf0e-f7e2d43d0524
osd.1 up inweight 1 up_from 33 up_thru 82 down_at 29 last_clean_interval [14,28) 192.168.128.102:6800/3105 192.168.128.102:6801/3105 192.168.128.102:6802/3105 192.168.128.102:6803/3105 exists,up c8b2811c-fb19-49c3-b630-374a4db7073e
osd.2 up inweight 1 up_from 35 up_thru 82 down_at 30 last_clean_interval [27,29) 192.168.128.101:6801/3173 192.168.128.101:6802/3173 192.168.128.101:6803/3173 192.168.128.101:6804/3173 exists,up 032998d3-03b5-458d-b32b-de48305e5b59
root@compute:/var/log/ceph#
(五)存储/恢复对象数据
注:为了能够操作Ceph存储集群中的对象数据,Ceph客户端必需满足:
1. 设置一个对象名
2. 指定一个数据池
Ceph客户端取回最新的集群映射表,并根据CRUSH算法先计算如何将对象映射到某个PG中,
然后再计算如何将该PG动态映射入一个Ceph OSD进程上。为了查找对象位置,你需要的仅仅是对象名称和数据池名称
ceph osd map {poolname} {object-name}
1.练习:定位一个对象
作为一个练习,我们先创建一个对象。使用rados put命令指定对象名称、存储对象数据的测试文件路径和地址池名称。例如:
格式:rados put {object-name} {file-path} --pool=data
rados put zhi-ceph zhi.txt --pool=data
2. 为了验证Ceph存储集群已存储该对象,执行如下命令
rados -p data ls
rados -p metadta ls
root@compute:/home/mengfei/my-cluster# rados -p data ls
zhi-ceph
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# rados -p metadata ls
609.00000000
mds0_sessionmap
608.00000000
601.00000000
602.00000000
mds0_inotable
1.00000000.inode
200.00000000
604.00000000
605.00000000
mds_anchortable
mds_snaptable
600.00000000
603.00000000
100.00000000
200.00000001
606.00000000
607.00000000
100.00000000.inode
1.00000000
root@compute:/home/mengfei/my-cluster#
3. 现在,可标识对象位置
格式:ceph osd map {pool-name} {object-name}
ceph osd map data zhi-ceph
ceph osd map metadata zhi-ceph
root@compute:/home/mengfei/my-cluster# ceph osd map data zhi-ceph
osdmap e89 pool 'data' (0) object 'zhi-ceph' -> pg 0.e67b1a3 (0.23) -> up (, p1) acting (, p1)
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph osd map metadata zhi-ceph
osdmap e89 pool 'metadata' (1) object 'zhi-ceph' -> pg 1.e67b1a3 (1.23) -> up (, p0) acting (, p0)
root@compute:/home/mengfei/my-cluster#
Ceph将输出对象位置信息,例如:
osdmap e537 pool 'data' (0) object 'test-object-1' -> pg 0.d1743484 (0.4) -> up acting
4. 删除测试对象,使用rados rm命令
rados rm zhi-ceph --pool=data
注:当集群扩展后,对象位置可能会动态变更。Ceph动态平衡的一个好处就是Ceph可自动完成迁移而无须你手动操作
5.创建一个池
ceph osd pool create zhi-pool 128
ceph osd pool set zhi-pool min_size 1
ceph -w 查看实时的迁移状态
(六)块设备快速启动
注:要使用这个指南,你必须首先在对象存储快速启动引导中执行程序。在使用Ceph的块设备工作前,确保您的Ceph的存储集群是在主动 + 清洁状态。在admin节点上执行这个快速启动。
注意:Ceph的块设备也被称为RBD或RADOS块设备
1. 安装 Ceph
(1) 检查linux的内核版本
lsb_release -a
uname -r
(2)在管理节点上,用ceph-deploy安装Ceph在你的ceph-client节点 (注:前边已经安装过,这里不再执行)
ceph-deploy install network(实例中将network作为client端)
(3) 在管理节点上,用ceph-deploy复制Ceph配置文件和ceph.client.admin.keyring到你的ceph-client
ceph-deploy admin network
2. 配置一个块设备
(1)在ceph-client节点上,创建一个块设备的镜像
rbd create foo --size 4096 [-m {mon-IP}] [-k /path/to/ceph.client.admin.keyring]
ceph osd pool create rbd-pool 128
ceph osd pool set rbd-pool min_size 1
rbd create foo --size 512
rbd create bar --size 256 --pool rbd-pool
rbd create zhi --size 512 --pool rbd-pool
ceph osd pool delete rbd-pool rbd-pool --yes-i-really-really-mean-it
注:删除一个pool,要指定两次poolname 并加上后边的参数
验证查询块设备信息:
rbd ls 查看块设备镜像
rbd ls rbd-pool 列出块设备在一个特定的池
rbd --image foo info 从一个特定的镜像查询信息
rbd --image bar -p rbd-pool info 查询一个池内的镜像信息
root@network:/home/mengfei/my-cluster# rbd ls
foo
root@network:/home/mengfei/my-cluster#
root@network:/home/mengfei/my-cluster# rbd ls rbd-pool
bar
root@network:/home/mengfei/my-cluster#
root@network:/home/mengfei# rbd showmapped
id pool image snap device
1rbd foo - /dev/rbd1
2rbd-pool zhi - /dev/rbd2
3rbd-pool bar - /dev/rbd3
root@network:/home/mengfei#
root@network:/home/mengfei/my-cluster# rbd --image foo info
rbd image 'foo':
size 512 MB in 128 objects
order 22 (4096 kB objects)
block_name_prefix: rb.0.16cb.2ae8944a
format: 1
root@network:/home/mengfei/my-cluster#
root@network:/home/mengfei/my-cluster# rbd --image bar -p rbd-pool info
rbd image 'bar':
size 512 MB in 128 objects
order 22 (4096 kB objects)
block_name_prefix: rb.0.16b8.2ae8944a
format: 1
root@network:/home/mengfei/my-cluster#
调整块设备镜像
Ceph的块设备镜像精简置备。他们实际上不使用任何物理存储,直到你开始保存数据。
然而,他们有一个最大容量-大小选项设置。如果你想增加(或减少)一个的CEPH座设备镜像的最大尺寸,执行以下命令:
rbd resize --image foo --size 1024 增加size
rbd resize --image foo --allow-shrink --size 512减小size
rbd resize --image zhi -p rbd-pool --size 1024
rbd resize --image zhi -p rbd-pool --allow-shrink --size 256
删除块设备镜像
rbd rm foo 要删除一个块设备
rbd rm bar -p rbd-pool 从池中删除一个块设备
(2)在ceph-client节点上,加载rbd客户端模块
modprobe rbd
(3)在ceph-client节点上,映射这个镜像到一个块设备
rbd map foo --pool rbd --name client.admin [-m {mon-IP}] [-k /path/to/ceph.client.admin.keyring]
rbd map foo
rbd map bar --pool rbd-pool
rbd map zhi --pool rbd-pool(实例中以zhi为例来操作)
rbd showmapped 显示映射块设备
取消块设备映射
格式:rbd unmap /dev/rbd/{poolname}/{imagename}
rbd unmap /dev/rbd/rbd/foo
(4)用这个块设备在一个ceph-client节点network上创建一个文件系统
mkfs.ext4 -m0 /dev/rbd/rbd-pool/zhi (这个可能要花费几分钟)
root@network:/home/mengfei/my-cluster# rbd map zhi --pool rbd-pool
root@network:/home/mengfei/my-cluster# mkfs.ext4 -m0 /dev/rbd/rbd-pool/zhi
mke2fs 1.42.9 (4-Feb-2014)
Filesystem label=
OS type: Linux
Block size=4096 (log=2)
Fragment size=4096 (log=2)
Stride=1024 blocks, Stripe width=1024 blocks
32768 inodes, 131072 blocks
0 blocks (0.00%) reserved for the super user
First data block=0
Maximum filesystem blocks=134217728
4 block groups
32768 blocks per group, 32768 fragments per group
8192 inodes per group
Superblock backups stored on blocks:
32768, 98304
Allocating group tables: done
Writing inode tables: done
Creating journal (4096 blocks): done
Writing superblocks and filesystem accounting information: done
root@network:/home/mengfei/my-cluster#
(5)挂载这个文件系统到你的ceph-client节点上
mkdir /mnt/ceph-block-device
mount /dev/rbd/rbd-pool/zhi /mnt/ceph-block-device
cd /mnt/ceph-block-device
修改自动加载:
默认情况下,创建块设备后,有个/etc/init.d/rbdmap文件
root@network:/home/mengfei# vi /etc/ceph/rbdmap (不知道为何,在管理节点compute有这个文件,client节点network上没有,暂从compute拷过来)
# RbdDevice Parameters
#poolname/imagename id=client,keyring=/etc/ceph/ceph.client.keyring
rbd-pool/zhi
#rbd-pool/bar
#rbd/foo
注:因为如果禁用了cephx,所以不必配置keyring了。
这样就可以手动控制、并且开关机可以自动挂载和卸载rbd块设备了
配置rbdmap (实例中/etc/init.d/rbdmap会自动生成)
创建rbd块设备并rbd map后,如果不及时rbd unmap,关机的时候系统会hung在umount此rbd设备上。
所以配置rbdmap是必须的。首先下载并设置开机启动rbdmap
$ sudo wget https://raw.github.com/ceph/ceph/a4ddf704868832e119d7949e96fe35ab1920f06a/src/init-rbdmap -O /etc/init.d/rbdmap
$ sudo chmod +x /etc/init.d/rbdmap
$ sudo update-rc.d rbdmap defaults
root@network:/home/mengfei# update-rc.d rbdmap defaults
Adding system startup for /etc/init.d/rbdmap ...
/etc/rc0.d/K20rbdmap -> ../init.d/rbdmap
/etc/rc1.d/K20rbdmap -> ../init.d/rbdmap
/etc/rc6.d/K20rbdmap -> ../init.d/rbdmap
/etc/rc2.d/S20rbdmap -> ../init.d/rbdmap
/etc/rc3.d/S20rbdmap -> ../init.d/rbdmap
/etc/rc4.d/S20rbdmap -> ../init.d/rbdmap
/etc/rc5.d/S20rbdmap -> ../init.d/rbdmap
root@network:/home/mengfei#
(6)验证rbd的信息
ceph -w
rados -p rbd-pool ls
ceph osd map rbd-pool rbd
root@network:/home/mengfei/my-cluster# rados -p rbd-pool ls
rb.0.16df.2ae8944a.000000000041
rb.0.16df.2ae8944a.000000000042
rbd_directory
rb.0.16df.2ae8944a.000000000060
rb.0.16df.2ae8944a.000000000001
rb.0.16df.2ae8944a.000000000020
bar.rbd
zhi.rbd
rb.0.16df.2ae8944a.000000000002
rb.0.16df.2ae8944a.000000000040
rb.0.16df.2ae8944a.000000000043
rb.0.16df.2ae8944a.00000000007f
rb.0.16df.2ae8944a.000000000000
root@network:/home/mengfei/my-cluster#
root@network:/home/mengfei/my-cluster# ceph osd map rbd-pool rbd
osdmap e139 pool 'rbd-pool' (4) object 'rbd' -> pg 4.7a31dfd8 (4.58) -> up (, p1) acting (, p1)
root@network:/home/mengfei/my-cluster#
(七)Ceph的文件系统快速入门
(1)先决条件
安装 ceph-common.
apt-get install ceph-common
注:确保Ceph的存储集群正在运行,并且在活跃 + 清洁状态。此外,确保你至少有一个Ceph的元数据服务器运行
ceph -s
root@compute:/var/lib/ceph/osd/ceph-osd2/current# ceph -s
cluster 8b2af1e6-92eb-4d74-9ca5-057522bb738f
health HEALTH_OK
monmap e3: 3 mons at {compute=192.168.128.101:6789/0,controller=192.168.128.100:6789/0,network=192.168.128.102:6789/0}, election epoch 72, quorum 0,1,2 controller,compute,network
mdsmap e34: 1/1/1 up {0=compute=up:active}
osdmap e139: 3 osds: 3 up, 3 in
pgmap v758: 448 pgs, 5 pools, 60758 kB data, 43 objects
584 MB used, 2470 MB / 3054 MB avail
448 active+clean
root@compute:/var/lib/ceph/osd/ceph-osd2/current#
(2)创建一个文件系统
ceph osd pool create cephfs_data 64 (注:用ceph osd dump可以看到新生成的pool名及int号,生成此pool id是5)
ceph osd pool create cephfs_metadata 64(生成此pool的id为6)
#ceph fs newfs mycephfs cephfs_metadata cephfs_data 此命令不对,用以下命令
ceph osd dump
创建新的mds fs pool:
格式:ceph mds newfs <int> <int> {--yes-i-really-mean-it} :make new filesystom using pools <metadata> and <data>
ceph mds dump
ceph mds newfs 6 5 --yes-i-really-mean-it
root@compute:/home/mengfei/my-cluster# ceph mds newfs 6 5 --yes-i-really-mean-it
new fs with metadata pool 6 and data pool 5
root@compute:/home/mengfei/my-cluster#
ceph mds add_data_pool cephfs_data (如果要添加pool,可以用此命令,如添加的是默认pool:data会无法删除,只能上边命令新建)
root@compute:/home/mengfei/my-cluster# ceph mds dump (注:默认情况下data_pools和metadata_pool是0和1)
dumped mdsmap epoch 201
epoch 201
flags 0
created 2014-12-04 15:09:55.788256
modified 2014-12-04 15:09:57.898040
tableserver 0
root 0
session_timeout 60
session_autoclose 300
max_file_size 1099511627776
last_failure 0
last_failure_osd_epoch0
compatcompat={},rocompat={},incompat={1=base v0.20,2=client writeable ranges,3=default file layouts on dirs,4=dir inode in separate object,5=mds uses versioned encoding,6=dirfrag is stored in omap}
max_mds 1
in 0
up {0=7938}
failed
stopped
data_pools 0
metadata_pool 1
inline_data disabled
7938: 192.168.128.101:6809/29475 'compute' mds.0.34 up:active seq 2
7883: 192.168.128.100:6807/21690 'controller' mds.-1.0 up:standby seq 1500
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph mds newfs 6 5 --yes-i-really-mean-it
new fs with metadata pool 6 and data pool 5
root@compute:/home/mengfei/my-cluster#
root@compute:/home/mengfei/my-cluster# ceph mds dump
dumped mdsmap epoch 206
epoch 206
flags 0
created 2014-12-04 15:15:10.430339
modified 2014-12-04 15:15:13.558941
tableserver 0
root 0
session_timeout 60
session_autoclose 300
max_file_size 1099511627776
last_failure 0
last_failure_osd_epoch0
compatcompat={},rocompat={},incompat={1=base v0.20,2=client writeable ranges,3=default file layouts on dirs,4=dir inode in separate object,5=mds uses versioned encoding,6=dirfrag is stored in omap}
max_mds 1
in 0
up {0=7953}
failed
stopped
data_pools 5
metadata_pool 6
inline_data disabled
7953: 192.168.128.101:6810/29475 'compute' mds.0.35 up:active seq 2
7883: 192.168.128.100:6807/21690 'controller' mds.-1.0 up:standby seq 1579
root@compute:/home/mengfei/my-cluster#
(3)创建一个秘钥文件
Ceph的存储集群运行的身份验证默认打开的。你应该有一个文件,其中包含密钥。为了获得针对特定用户的密钥,请执行下列步骤:
<1> 识别keyring文件内的一个用户。例如:
cat ceph.client.admin.keyring
root@compute:/home/mengfei/my-cluster# cat ceph.client.admin.keyring
key = AQBe33ZUQBvWFBAApPAN9YAiqSFJQrTXv/TM1A==
root@compute:/home/mengfei/my-cluster#
<2> 用户将使用安装Ceph的FS文件系统复制。操作步骤如下:
key = AQCj2YpRiAe6CxAA7/ETt7Hcl9IyxyYciVs47w==
<3> 打开一个文本编辑器
<4> 粘贴秘钥到一个空文件。操作步骤如下:
AQCj2YpRiAe6CxAA7/ETt7Hcl9IyxyYciVs47w==
<5> 将文件保存的用户名作为一个属性(例如, /etc/ceph/admin.secret).
<6> Ensure the file permissions are appropriate for the user, but notvisible to other users.
(4)内核驱动
mkdir /mnt/mycephfs
格式:mount -t ceph {ip-address-of-monitor}:6789:/ /mnt/mycephfs
mount -t ceph {ip-address-of-monitor}:6789:/ /mnt/mycephfs
Ceph的存储集群,默认情况下,使用验证。指定一个用户的名称和secretfile中创建创建一个秘密的文件部分。例如:
mount -t ceph 192.168.128.101:6789:/ /mnt/mycephfs -o name=admin,secretfile=/etc/ceph/admin.secret
注意:在client节点上挂载Ceph FS文件系统,而不是服务器monitor节点。
在客户端client上执行:
vi /etc/fstab添加开机自动mount
格式:{ipaddress}:{port}:/ /{mountpoint} {filesystem-name} ,[{mount.options}]
192.168.128.101:6789:/ /mnt/mycephfs ceph name=admin,secretfile=/etc/ceph/admin.secret,noatime 0 2
(5)Ceph的文件系统(FUSE)(注:此步没操作成功,提示ceph-fuse:command not found)
载Ceph FS作为一个在用户空间文件系统(FUSE)
mkdir ~/mycephfs
格式:ceph-fuse -m {ip-address-of-monitor}:6789 ~/mycephfs
Ceph的存储集群在默认情况下使用验证。如果它不是在默认位置,指定一个密钥(即:/etc/ceph):
ceph-fuse -k ./ceph.client.admin.keyring -m 192.168.128.101:6789 ~/mycephfs 发重复了,故删掉一条
页:
[1]