HoguInside & ㅁ ㅇ ㄷ nail

CEPH - CRUSHTOOL (Ver. 10.2.11_jewel, OS. Centos7 ) 사용시 장애 포인트 (버그??)

By 때찌때찌맴매 - 1월 07, 2019

CEPH - CRUSHTOOL (Ver. 10.2.11_jewel, OS. Centos7 ) 사용시 장애 포인트 ( 버그??)

* 해당 크러시맵을 수정 하여 적용하면 드문경우로 인식을 제대로 못할때가 발생됨. 최초에 적용 할때 URL=crushtool_링크 로 적용 했으나 문제가 발생했는데 그 결과가 아래와 같음.

[ 상태 확인 ]

[root@MGMT11:25:40:~]# ceph osd tree
ID WEIGHT   TYPE NAME          UP/DOWN REWEIGHT PRIMARY-AFFINITY 
-2  0.90599 root ssd                                             
-7  0.45399     ssd_osd OSD-20                                   
 4  0.45399         osd.4           up  1.00000          1.00000 
-8  0.45200     ssd_osd OSD-21                                   
 5  0.45200         osd.5           up  1.00000          1.00000 
-1 58.15997 root hdd                                             
-3 14.53999     hdd_osd OSD-0                                    
 0 14.53999         osd.0           up  1.00000          1.00000 
-4 14.53999     hdd_osd OSD-1                                    
 1 14.53999         osd.1           up  1.00000          1.00000 
-5 14.53999     hdd_osd OSD-2                                    
 2 14.53999         osd.2           up  1.00000          1.00000 
-6 14.53999     hdd_osd OSD-3                                    
 3 14.53999         osd.3           up  1.00000          1.00000 

* 음.. 정상인데..?? 하고 외관상 문제가 없어 상태값 확인하니..

[root@MGMT11:25:48:~]# ceph -s
    cluster 427f2e6a-5722-4365-a475-8fcdc218a418
     health HEALTH_WARN
            128 pgs stuck unclean
     monmap e2: 4 mons at {MON-0=192.168.1.13:6789/0,MON-1=192.168.1.14:6789/0,MON-2=192.168.1.15:6789/0,MON-3=192.168.1.16:6789/0}
            election epoch 6, quorum 0,1,2,3 MON-0,MON-1,MON-2,MON-3
     osdmap e79: 6 osds: 6 up, 6 in; 128 remapped pgs
            flags sortbitwise,require_jewel_osds
      pgmap v249: 256 pgs, 2 pools, 0 bytes data, 0 objects
            659 MB used, 60483 GB / 60484 GB avail
                 128 active+clean
                 128 active+remapped

[root@MGMT11:26:18:~]# ceph health detail
HEALTH_WARN 128 pgs stuck unclean
pg 1.7e is stuck unclean for 701.594548, current state active+remapped, last acting [2,1]
pg 1.7f is stuck unclean for 699.224062, current state active+remapped, last acting [0,2]
pg 1.7c is stuck unclean for 699.223706, current state active+remapped, last acting [0,3]
pg 1.7d is stuck unclean for 699.273517, current state active+remapped, last acting [1,2]
pg 1.7a is stuck unclean for 701.337639, current state active+remapped, last acting [3,2]
.
.
.
* ..............

* 필자는 당황.. ㄷㄷㄷ;; 일단 osd와 버킷 룰들이 제대로 매칭이 안되는듯 하여 만인의 해결책인 osd 재시작 시전. pg(pgp) 들이 재정렬 되면서 돌아오는가 싶더니...

[root@MGMT11:41:47:~]# ceph -s
    cluster 427f2e6a-5722-4365-a475-8fcdc218a418
     health HEALTH_WARN
            255 pgs degraded
            228 pgs stale
            156 pgs stuck unclean
            255 pgs undersized
     monmap e2: 4 mons at {MON-0=192.168.1.13:6789/0,MON-1=192.168.1.14:6789/0,MON-2=192.168.1.15:6789/0,MON-3=192.168.1.16:6789/0}
            election epoch 6, quorum 0,1,2,3 MON-0,MON-1,MON-2,MON-3
     osdmap e116: 6 osds: 6 up, 6 in; 28 remapped pgs
            flags sortbitwise,require_jewel_osds
      pgmap v339: 256 pgs, 2 pools, 0 bytes data, 0 objects
            664 MB used, 60483 GB / 60484 GB avail
                 128 stale+active+undersized+degraded
                 100 stale+active+undersized+degraded+remapped
                  27 active+undersized+degraded+remapped
                   1 active+remapped

[root@MGMT11:42:04:~]# ceph osd tree
ID WEIGHT   TYPE NAME          UP/DOWN REWEIGHT PRIMARY-AFFINITY 
-9        0 root ssd                                             
-6        0     ssd_osd OSD-20                                   
-7        0     ssd_osd OSD-21                                   
-8        0 root hdd                                             
-2        0     hdd_osd OSD-0                                    
-3        0     hdd_osd OSD-1                                    
-4        0     hdd_osd OSD-2                                    
-5        0     hdd_osd OSD-3                                    
-1 59.06596 root default                                         
 5  0.45200     osd.5               up  1.00000          1.00000 
 4  0.45399     osd.4               up  1.00000          1.00000 
 3 14.53999     osd.3               up  1.00000          1.00000 
 2 14.53999     osd.2               up  1.00000          1.00000 
 1 14.53999     osd.1               up  1.00000          1.00000 
 0 14.53999     osd.0               up  1.00000          1.00000 

* 당황 그 자체... 각 OSD 버킷들은 정상적으로 타입별로 들어 갔으나.. 제일 중요한.. 알맹이인 osd 장치들이 삭제한 default 버킷에 들어가 있는것...;; 아니..왜...??
* crushmap 으로 확인해 보니 default 버킷에 item으로 해당 osd 들이 삽입 되어있는 광경이 펼쳐짐... 왜지? 지웠는데....

[ crushmap 확인 ]

[root@MGMT11:42:07:~]# ceph osd getcrushmap -o /tmp/crushmap
got crush map from osdmap epoch 116
[root@MGMT11:42:36:~]# crushtool -d /tmp/crushmap -o /tmp/crushmap.txt
[root@MGMT11:42:36:~]# cat /tmp/crushmap.txt 
# begin crush map
tunable choose_local_tries 0
tunable choose_local_fallback_tries 0
tunable choose_total_tries 50
tunable chooseleaf_descend_once 1
tunable chooseleaf_vary_r 1
tunable straw_calc_version 1
tunable allowed_bucket_algs 54

# devices
device 0 osd.0
device 1 osd.1
device 2 osd.2
device 3 osd.3
device 4 osd.4
device 5 osd.5

# types
type 0 osd
type 1 ssd_osd
type 2 hdd_osd
type 3 root

# buckets
root default {
 id -1  # do not change unnecessarily
 # weight 59.066   # 갑자기 삭제한 default가 생기더니 알맹이들 다가져감...;;
 alg straw2
 hash 0 # rjenkins1
 item osd.5 weight 0.452
 item osd.4 weight 0.454
 item osd.3 weight 14.540
 item osd.2 weight 14.540
 item osd.1 weight 14.540
 item osd.0 weight 14.540
}
hdd_osd OSD-0 {
 id -2  # do not change unnecessarily
 # weight 0.000
 alg straw
 hash 0 # rjenkins1
}
hdd_osd OSD-1 {
 id -3  # do not change unnecessarily
 # weight 0.000
 alg straw
 hash 0 # rjenkins1
}
hdd_osd OSD-2 {
 id -4  # do not change unnecessarily
 # weight 0.000
 alg straw
 hash 0 # rjenkins1
}
hdd_osd OSD-3 {
 id -5  # do not change unnecessarily
 # weight 0.000
 alg straw
 hash 0 # rjenkins1
}
ssd_osd OSD-20 {
 id -6  # do not change unnecessarily
 # weight 0.000
 alg straw
 hash 0 # rjenkins1
}
ssd_osd OSD-21 {
 id -7  # do not change unnecessarily
 # weight 0.000
 alg straw
 hash 0 # rjenkins1
}
root hdd {
 id -8  # do not change unnecessarily
 # weight 0.000
 alg straw
 hash 0 # rjenkins1
 item OSD-0 weight 0.000
 item OSD-1 weight 0.000
 item OSD-2 weight 0.000
 item OSD-3 weight 0.000
}
root ssd {
 id -9  # do not change unnecessarily
 # weight 0.000
 alg straw
 hash 0 # rjenkins1
 item OSD-20 weight 0.000
 item OSD-21 weight 0.000
}

# rules
rule hdd {
 ruleset 0
 type replicated
 min_size 1
 max_size 10
 step take hdd
 step chooseleaf firstn 0 type hdd_osd
 step emit
}
rule ssd {
 ruleset 1
 type replicated
 min_size 1
 max_size 10
 step take ssd
 step chooseleaf firstn 0 type ssd_osd
 step emit
}

# end crush map

* 이때 필자는 적지 않게 당황 함...  이 블로그를 보시는 분들은 당황 하지 말고 처음에 적용 했던 룰을 다시 적용 해보시길..  그래도 안된다??  필자도 " 또 " 안됐음.. ㅋㅋㅋ 다시 확인 해보니 #devices 아래 osd 들과. 각 버킷의 아래에 "item osd.4 weight 0.454" 형태의 osd명과 용량의 내용이 삭제 되어 적용 된것. 해당 crushmap 을 다시 설정 해서 적용 하면 정상적으로 나오니  안된다고, 다시 날리고 재설치 하는 일이 없도록 하시긴..ㅠㅠ

[ crushmap 수정 및 복원 ]

i. crushmap 수정

[root@MGMT01:04:11:~]# vi /tmp/crushmap.txt 
# begin crush map
tunable choose_local_tries 0
tunable choose_local_fallback_tries 0
tunable choose_total_tries 50
tunable chooseleaf_descend_once 1
tunable chooseleaf_vary_r 1
tunable straw_calc_version 1
tunable allowed_bucket_algs 54

# devices
device 0 osd.0
device 1 osd.1
device 2 osd.2
device 3 osd.3
device 4 osd.4
device 5 osd.5

# types
type 0 osd
type 1 ssd_osd
type 2 hdd_osd
type 3 root

# buckets
hdd_osd OSD-0 {
 id -10  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.0 weight 14.540
}
hdd_osd OSD-1 {
 id -11  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.1 weight 14.540
}
hdd_osd OSD-2 {
 id -12  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.2 weight 14.540
}
hdd_osd OSD-3 {
 id -13  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.3 weight 14.540
}
root hdd {
 id -1  # do not change unnecessarily
 # weight 58.160
 alg straw
 hash 0 # rjenkins1
 item OSD-0 weight 14.540
 item OSD-1 weight 14.540
 item OSD-2 weight 14.540
 item OSD-3 weight 14.540
}
ssd_osd OSD-20 {
 id -20  # do not change unnecessarily
 # weight 0.454
 alg straw
 hash 0 # rjenkins1
 item osd.4 weight 0.454
}
ssd_osd OSD-21 {
 id -21  # do not change unnecessarily
 # weight 0.454
 alg straw
 hash 0 # rjenkins1
 item osd.5 weight 0.454
}
root ssd {
 id -2  # do not change unnecessarily
 # weight 0.908
 alg straw
 hash 0 # rjenkins1
 item OSD-20 weight 0.454
 item OSD-21 weight 0.454
}

# rules
rule hdd {
 ruleset 0
 type replicated
 min_size 1
 max_size 10
 step take hdd
 step chooseleaf firstn 0 type hdd_osd
 step emit
}
rule ssd {
 ruleset 1
 type replicated
 min_size 1
 max_size 10
 step take ssd
 step chooseleaf firstn 0 type ssd_osd
 step emit
}

# end crush map

ii. crushmap 복원

[root@MGMT10:51:15:~]# crushtool -c /tmp/crushmap.txt -o /tmp/crushmap-new.bin
[root@MGMT11:13:45:~]# crushtool -c /tmp/crushmap.txt -o /tmp/crushmap.coloc
[root@MGMT11:14:19:~]# ceph osd setcrushmap -i /tmp/crushmap.coloc 

[root@MGMT01:09:13:~]# ceph osd tree
ID  WEIGHT   TYPE NAME          UP/DOWN REWEIGHT PRIMARY-AFFINITY 
 -2  0.90799 root ssd                                             
-20  0.45399     ssd_osd OSD-20                                   
  4  0.45399         osd.4           up  1.00000          1.00000 
-21  0.45399     ssd_osd OSD-21                                   
  5  0.45399         osd.5           up  1.00000          1.00000 
 -1 58.15997 root hdd                                             
-10 14.53999     hdd_osd OSD-0                                    
  0 14.53999         osd.0           up  1.00000          1.00000 
-11 14.53999     hdd_osd OSD-1                                    
  1 14.53999         osd.1           up  1.00000          1.00000 
-12 14.53999     hdd_osd OSD-2                                    
  2 14.53999         osd.2           up  1.00000          1.00000 
-13 14.53999     hdd_osd OSD-3                                    
  3 14.53999         osd.3           up  1.00000          1.00000 

[root@MGMT01:11:03:~]# ceph -s
    cluster 427f2e6a-5722-4365-a475-8fcdc218a418
     health HEALTH_OK
     monmap e2: 4 mons at {MON-0=192.168.1.13:6789/0,MON-1=192.168.1.14:6789/0,MON-2=192.168.1.15:6789/0,MON-3=192.168.1.16:6789/0}
            election epoch 6, quorum 0,1,2,3 MON-0,MON-1,MON-2,MON-3
     osdmap e125: 6 osds: 6 up, 6 in
            flags sortbitwise,require_jewel_osds
      pgmap v424: 256 pgs, 2 pools, 0 bytes data, 0 objects
            667 MB used, 60483 GB / 60484 GB avail
                 256 active+clean

* 복원 완료

0 Comments

CEPH

CEPH - CRUSHMAP 으로 룰적용. (Ver. 10.2.11_jewel, OS. Centos7 ) with CRUSHTOOL

By 때찌때찌맴매 - 12월 21, 2018

CEPH - CRUSHMAP 으로 룰적용. (Ver. 10.2.11_jewel, OS. Centos7 ) with CRUSHTOOL

* 수동으로 bucket 이동 작업을 하면서 거슬리는 default bucket 을 제거 하기로 함.

* default bucket을 제거 하기 위해서는 crushmap 을 이용하여 제거가 가능.

* 해당 작업은 default bucket 제거겸!! 익숙해지면 편한 작업임을 알립니다.

[ CRUSHMAP 으로 rule set 적용 ]

i. CRUSHMAP 생성 및 수정

* crushtool을 이용하여 crushmap  추출.

[root@MGMT10:33:18:~]# ceph osd getcrushmap -o /tmp/crushmap
[root@MGMT10:33:18:~]# crushtool -d /tmp/crushmap -o /tmp/crushmap.txt
[root@MGMT10:33:18:~]# cat /tmp/crushmap.txt 

* crusnmap 내용 수정전

# begin crush map
tunable choose_local_tries 0
tunable choose_local_fallback_tries 0
tunable choose_total_tries 50
tunable chooseleaf_descend_once 1
tunable chooseleaf_vary_r 1
tunable straw_calc_version 1

# devices
device 0 osd.0
device 1 osd.1
device 2 osd.2
device 3 osd.3
device 4 osd.4
device 5 osd.5

# types
type 0 osd
type 1 host
type 2 chassis
type 3 rack
type 4 row
type 5 pdu
type 6 pod
type 7 room
type 8 datacenter
type 9 region
type 10 root

# buckets
host OSD-0 {
 id -2  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.0 weight 14.540
}
host OSD-1 {
 id -3  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.1 weight 14.540
}
host OSD-2 {
 id -4  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.2 weight 14.540
}
host OSD-3 {
 id -5  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.3 weight 14.540
}
root hdd {
 id -8  # do not change unnecessarily
 # weight 58.161
 alg straw
 hash 0 # rjenkins1
 item OSD-0 weight 14.540
 item OSD-1 weight 14.540
 item OSD-2 weight 14.540
 item OSD-3 weight 14.540
}
host OSD-20 {
 id -6  # do not change unnecessarily
 # weight 0.454
 alg straw
 hash 0 # rjenkins1
 item osd.4 weight 0.454
}
host OSD-21 {
 id -7  # do not change unnecessarily
 # weight 0.452
 alg straw
 hash 0 # rjenkins1
 item osd.5 weight 0.452
}
root ssd {
 id -9  # do not change unnecessarily
 # weight 0.906
 alg straw
 hash 0 # rjenkins1
 item OSD-20 weight 0.454
 item OSD-21 weight 0.452
}
root default {
 id -1  # do not change unnecessarily
 # weight 59.067
 alg straw
 hash 0 # rjenkins1
 item hdd weight 58.161
 item ssd weight 0.906
}

# rules
rule hdd {
 ruleset 0
 type replicated
 min_size 1
 max_size 10
 step take default
 step chooseleaf firstn 0 type host
 step emit
}
rule ssd {
 ruleset 1
 type replicated
 min_size 1
 max_size 10
 step take default
 step chooseleaf firstn 0 type host
 step emit
}

# end crush map


* 처음에는 복잡해 보이지만 몇 번 보면 규칙적으로 보기가 편해짐.
* 수정 내용은 #type 과, root default 가 삭제 되고 rules type 변경 등이 있습니다.

** CRUSHMAP 수정 후

# begin crush map
tunable choose_local_tries 0
tunable choose_local_fallback_tries 0
tunable choose_total_tries 50
tunable chooseleaf_descend_once 1
tunable chooseleaf_vary_r 1
tunable straw_calc_version 1
tunable allowed_bucket_algs 54

# devices
device 0 osd.0
device 1 osd.1
device 2 osd.2
device 3 osd.3
device 4 osd.4
device 5 osd.5

# types
type 0 osd
type 1 ssd_osd
type 2 hdd_osd
type 3 root

# buckets
hdd_osd OSD-0 {
 id -10  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.0 weight 14.540
}
hdd_osd OSD-1 {
 id -11  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.1 weight 14.540
}
hdd_osd OSD-2 {
 id -12  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.2 weight 14.540
}
hdd_osd OSD-3 {
 id -13  # do not change unnecessarily
 # weight 14.540
 alg straw
 hash 0 # rjenkins1
 item osd.3 weight 14.540
}
root hdd {
 id -1  # do not change unnecessarily
 # weight 58.160
 alg straw
 hash 0 # rjenkins1
 item OSD-0 weight 14.540
 item OSD-1 weight 14.540
 item OSD-2 weight 14.540
 item OSD-3 weight 14.540
}
ssd_osd OSD-20 {
 id -20  # do not change unnecessarily
 # weight 0.454
 alg straw
 hash 0 # rjenkins1
 item osd.4 weight 0.454
}
ssd_osd OSD-21 {
 id -21  # do not change unnecessarily
 # weight 0.454
 alg straw
 hash 0 # rjenkins1
 item osd.5 weight 0.454
}
root ssd {
 id -2  # do not change unnecessarily
 # weight 0.908
 alg straw
 hash 0 # rjenkins1
 item OSD-20 weight 0.454
 item OSD-21 weight 0.454
}

# rules
rule hdd {
 ruleset 0
 type replicated
 min_size 2
 max_size 2
 step take hdd
 step chooseleaf firstn 0 type hdd_osd
 step emit
}
rule ssd {
 ruleset 1
 type replicated
 min_size 2
 max_size 2
 step take ssd
 step chooseleaf firstn 0 type ssd_osd
 step emit
}

# end crush map

* 수정된 crushmap 적용

[root@MGMT10:51:15:~]# crushtool -c /tmp/crushmap.txt -o /tmp/crushmap-new.bin
[root@MGMT11:13:45:~]# crushtool -c /tmp/crushmap.txt -o /tmp/crushmap.coloc
[root@MGMT11:14:19:~]# ceph osd setcrushmap -i /tmp/crushmap.coloc 
set crush map

[root@MGMT12:35:32:~]# ceph osd tree
ID  WEIGHT   TYPE NAME          UP/DOWN REWEIGHT PRIMARY-AFFINITY 
 -2  0.90799 root ssd                                             
-20  0.45399     ssd_osd OSD-20                                   
  4  0.45399         osd.4           up  1.00000          1.00000 
-21  0.45399     ssd_osd OSD-21                                   
  5  0.45399         osd.5           up  1.00000          1.00000 
 -1 58.15997 root hdd                                             
-10 14.53999     hdd_osd OSD-0                                    
  0 14.53999         osd.0           up  1.00000          1.00000 
-11 14.53999     hdd_osd OSD-1                                    
  1 14.53999         osd.1           up  1.00000          1.00000 
-12 14.53999     hdd_osd OSD-2                                    
  2 14.53999         osd.2           up  1.00000          1.00000 
-13 14.53999     hdd_osd OSD-3                                    
  3 14.53999         osd.3           up  1.00000          1.00000 

[root@MGMT12:37:11:~]# ceph -s
    cluster 427f2e6a-5722-4365-a475-8fcdc218a418
     health HEALTH_OK
     monmap e2: 4 mons at {MON-0=192.168.1.13:6789/0,MON-1=192.168.1.14:6789/0,MON-2=192.168.1.15:6789/0,MON-3=192.168.1.16:6789/0}
            election epoch 6, quorum 0,1,2,3 MON-0,MON-1,MON-2,MON-3
     osdmap e125: 6 osds: 6 up, 6 in
            flags sortbitwise,require_jewel_osds
      pgmap v424: 256 pgs, 2 pools, 0 bytes data, 0 objects
            667 MB used, 60483 GB / 60484 GB avail
                 256 active+clean

[root@MGMT12:37:20:~]# ceph osd dump | grep pool
pool 0 'hdd_pool' replicated size 2 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 128 pgp_num 128 last_change 59 flags hashpspool stripe_width 0
pool 1 'ssd_pool' replicated size 2 min_size 1 crush_ruleset 1 object_hash rjenkins pg_num 128 pgp_num 128 last_change 74 flags hashpspool stripe_width 0


* 불필요한 내용들은 트리에 재거 되어 깔끔하게 구성이 됐습니다.

0 Comments

CEPH

CEPH - CRUSH RULE SET (Ver. 10.2.11_jewel, OS. Centos7 ) - with 삽질..

By 때찌때찌맴매 - 12월 21, 2018

ceph - CRUSH RULE SET (Ver. 10.2.11_jewel, OS. Centos7) - with 삽질

* ceph crush rule 를 사용하여 osd 구분 설정. SATA osd, SSD osd 나눠서 사용

* 위 설명대로 hdd 타입별로 구성하여 운영 했지만( sata,ssd 상품군이 둘다 달랐음..;;) 이번에는 같은 인프라에서 둘다 사용 할 것을 가정하고 구성 진행.(지금은 ssd 를 사용하지 않지만, 추후에 ssd 타입도 추가 하라는 오더가 나올꺼기 때문...)

* 해당 작업을 하는 이유는 리얼서버 + ceph를 사용할때 상단에 하이퍼 바이저 개념인 openstack 등이 없기 때문에 ceph에 자동으로 접근 하려면 os를 커스텀 해야 함.. os 커스텀은.. 나중에.. 포스팅..( 계속 나중이라함...ㅈㅅ)

* 참고 :

https://access.redhat.com/documentation/en-us/red_hat_ceph_storage/1.2.3/html/storage_strategies/crush-rules

https://blog.netways.de/2017/03/03/ceph-crush-rules-uber-die-cli/

[ CRUSH RULE SET 생성 및 적용 ]

i. RULE 생성
* ceph를 구성하면 기본적으로 rbd 풀이 생성되어 있는데(10버전까지. 11버전은 모르겠고...12버전은 없음) 해당 pool을 rename 해서 hdd pool로 변경

[root@MGMT09:44:39:]# ceph osd pool rename rbd hdd_pool

* 해당 명령어를 통해 룰 생성
============================================================================================
ceph osd crush rule create-simple {rulename} {root} {bucket-type} {first|indep}
============================================================================================

[root@MGMT09:44:39:]# ceph osd crush rule create-simple ssd default host firstn

[root@MGMT09:48:21:# ceph osd crush rule ls
[
    "replicated_ruleset",
    "ssd"
]

* ssd 룰셋을 생성하기 전에 기본으로 있던 " replicated ruleset "은 제거. (제거 안해도됨. 그냥 깔끔하게 하기 위해서 제거하는것)

[root@MGMT09:48:27:]# ceph osd crush rule rm replicated_ruleset
[root@MGMT09:48:46:]# ceph osd crush rule ls
[
    "ssd"
]

[root@MGMT09:49:03:]# ceph osd crush rule create-simple hdd default host firstn
[root@MGMT09:49:18:]# ceph osd crush rule ls
[
    "hdd",
    "ssd"
]


ii. POOL 생성 및 RULE 적용
* 룰셋을 나누고 osd를 분류 하기 전에는 pool 생성 명령어는 pgp 값 까지만 넣었으나 해당 작업들을 진행하면서 한번에 룰셋까지 적용.
* rename된 hdd_pool 은 바로 룰을 적용 시키고, ssd 는 풀 생성과 동시에 룰 적용.

[root@MGMT09:44:39:]# ceph osd pool set hdd_pool crush_ruleset 0
set pool 1 crush_ruleset to 0

* 해당 명령어를 통해 풀생성과 동시에 룰셋 적용 
============================================================================================
ceph osd pool create {pool_name} {pg} {pgp} {replicated | erasure-code-profile} {crush-rule-name}
============================================================================================
[root@MGMT09:49:39:]# ceph osd pool create ssd_pool 128 128 replicated ssd
pool 'ssd_pool' created

* ceph osd dump 를 통해 확인 시, 각 pool 과 crush ruleset이 적용 되어 있는 것을 확인 할 수 있음.
  rule 은 순서대로   0 = hdd, 1 = ssd 로 확인 가능.

[root@MGMT09:49:40:~]# ceph osd dump | grep pool
pool 0 'hdd_pool' replicated size 2 min_size 1 crush_ruleset 0 object_hash rjenkins pg_num 128 pgp_num 128 last_change 59 flags hashpspool stripe_width 0
pool 1 'ssd_pool' replicated size 2 min_size 1 crush_ruleset 1 object_hash rjenkins pg_num 128 pgp_num 128 last_change 74 flags hashpspool stripe_width 0


iii. bucket 생성 및 타입 osd bucket 이동

* bucket을 생성 하여 그 하위에 리스트들을 넣어놓으면 그룹으로 인식하여 각 pool별로 독립성을 가지게됨.
* ceph 운영중에 ssd osd가 문제가 생겼을 경우에 hdd_pool 은 아무런 영향을 갖지 않고, 각 타입 osd 별로 영향을 받게됨.

[root@MGMT09:33:05:]# ceph osd crush add-bucket hdd root
added bucket hdd type root to crush map
[root@MGMT09:34:04:]# ceph osd crush add-bucket ssd root
added bucket ssd type root to crush map

* ceph osd tree를 통해 bucket 생성 확인

[root@MGMT01:31:51:~]# ceph osd tree
ID WEIGHT   TYPE NAME       UP/DOWN REWEIGHT PRIMARY-AFFINITY 
-9        0 root ssd                                          
-8        0 root hdd                                          
-1 59.06656 root default                                      
-2 14.54019     host OSD-0                                    
 0 14.54019         osd.0        up  1.00000          1.00000 
-3 14.54019     host OSD-1                                    
 1 14.54019         osd.1        up  1.00000          1.00000 
-4 14.54019     host OSD-2                                    
 2 14.54019         osd.2        up  1.00000          1.00000 
-5 14.54019     host OSD-3                                    
 3 14.54019         osd.3        up  1.00000          1.00000 
-6  0.45380     host OSD-20                                   
 4  0.45380         osd.4        up  1.00000          1.00000 
-7  0.45200     host OSD-21                                   
 5  0.45200         osd.5        up  1.00000          1.00000 

* hdd osd 와 ssd osd 를  각 bucket으로 이동
============================================================================================
* 해당 작업을 통해 장애 포인트 발생.
============================================================================================
[root@MGMT03:38:38:~]# ceph osd crush move OSD-0 root=hdd
moved item id -2 name 'OSD-0' to location {root=hdd} in crush map
[root@MGMT03:38:38:~]# ceph osd crush move OSD-1 root=hdd
moved item id -3 name 'OSD-1' to location {root=hdd} in crush map
[root@MGMT03:38:38:~]# ceph osd crush move OSD-2 root=hdd
moved item id -4 name 'OSD-2' to location {root=hdd} in crush map
[root@MGMT03:38:38:~]# ceph osd crush move OSD-3 root=hdd
moved item id -5 name 'OSD-3' to location {root=hdd} in crush map
[root@MGMT03:38:38:~]# ceph osd crush move OSD-20 root=ssd
moved item id -6 name 'OSD-20' to location {root=ssd} in crush map
[root@MGMT03:38:38:~]# ceph osd crush move OSD-21 root=ssd
moved item id -7 name 'OSD-21' to location {root=ssd} in crush map


[root@MGMT10:02:25:~]# ceph osd tree
ID WEIGHT   TYPE NAME       UP/DOWN REWEIGHT PRIMARY-AFFINITY 
-9  0.90579 root ssd                                          
-6  0.45380     host OSD-20                                   
 4  0.45380         osd.4        up  1.00000          1.00000 
-7  0.45200     host OSD-21                                   
 5  0.45200         osd.5        up  1.00000          1.00000 
-8 58.16077 root hdd                                          
-2 14.54019     host OSD-0                                    
 0 14.54019         osd.0        up  1.00000          1.00000 
-3 14.54019     host OSD-1                                    
 1 14.54019         osd.1        up  1.00000          1.00000 
-4 14.54019     host OSD-2                                    
 2 14.54019         osd.2        up  1.00000          1.00000 
-5 14.54019     host OSD-3                                    
 3 14.54019         osd.3        up  1.00000          1.00000 
-1        0 root default 

* 위와 같이 진행하게 되면 정상적으로 버킷리스트들이 보이긴 하지만 동기화등 에러등으로 warn 에서 err 로 변함.

[root@MGMT10:02:34:~]# ceph -s
    cluster 427f2e6a-5722-4365-a475-8fcdc218a418
     health HEALTH_WARN
            128 pgs stuck unclean
            too few PGs per OSD (21 < min 30)
     monmap e2: 4 mons at {MON-0=192.168.1.13:6789/0,MON-1=192.168.1.14:6789/0,MON-2=192.168.1.15:6789/0,MON-3=192.168.1.16:6789/0}
            election epoch 6, quorum 0,1,2,3 MON-0,MON-1,MON-2,MON-3
     osdmap e61: 6 osds: 6 up, 6 in; 128 remapped pgs
            flags sortbitwise,require_jewel_osds
      pgmap v186: 256 pgs, 2 pools, 0 bytes data, 0 objects
            652 MB used, 60483 GB / 60484 GB avail
                 128 active+remapped
============================================================================================

* osd를 타입별로 분류 하고 상태 값 확인 하니 경고 상태. 이유는 최상위 버킷 타입으로 되어 있는 "default" 가 자리 잡고 있기 때문.
  해결 방법은  default 버킷을 제거 하던가 ssd, hdd bucket들을 하위로 넣어 줘야 됨. 하지만 해당 jewel 10버전에서는 default가 명령어로는 제거가 안되기 때문에 명령어로는 하위로 넣고, 제거를 하려면 crushtool을 이용해 crushmap을 밀어 넣어야함. crushmap은 url=crushmap사용방법(링크) 에서 확인.

[root@MGMT10:12:05:~]# ceph osd crush move hdd root=default
moved item id -8 name 'hdd' to location {root=default} in crush map
[root@MGMT10:18:12:~]# ceph osd crush move ssd root=default
moved item id -9 name 'ssd' to location {root=default} in crush map
[root@MGMT10:18:19:~]# ceph osd tree
ID WEIGHT   TYPE NAME           UP/DOWN REWEIGHT PRIMARY-AFFINITY 
-1 59.06656 root default                                          
-8 58.16077     root hdd                                          
-2 14.54019         host OSD-0                                    
 0 14.54019             osd.0        up  1.00000          1.00000 
-3 14.54019         host OSD-1                                    
 1 14.54019             osd.1        up  1.00000          1.00000 
-4 14.54019         host OSD-2                                    
 2 14.54019             osd.2        up  1.00000          1.00000 
-5 14.54019         host OSD-3                                    
 3 14.54019             osd.3        up  1.00000          1.00000 
-9  0.90579     root ssd                                          
-6  0.45380         host OSD-20                                   
 4  0.45380             osd.4        up  1.00000          1.00000 
-7  0.45200         host OSD-21                                   
 5  0.45200             osd.5        up  1.00000          1.00000 


[root@MGMT10:18:22:~]# ceph -s
    cluster 427f2e6a-5722-4365-a475-8fcdc218a418
     health HEALTH_OK
     monmap e2: 4 mons at {MON-0=192.168.1.13:6789/0,MON-1=192.168.1.14:6789/0,MON-2=192.168.1.15:6789/0,MON-3=192.168.1.16:6789/0}
            election epoch 6, quorum 0,1,2,3 MON-0,MON-1,MON-2,MON-3
     osdmap e73: 6 osds: 6 up, 6 in
            flags sortbitwise,require_jewel_osds
      pgmap v219: 256 pgs, 2 pools, 0 bytes data, 0 objects
            659 MB used, 60483 GB / 60484 GB avail
                 128 active+clean

0 Comments

HoguInside & ㅁ ㅇ ㄷ nail

IT. POST LIST

CEPH - CRUSHTOOL (Ver. 10.2.11_jewel, OS. Centos7 ) 사용시 장애 포인트 (버그??)

CEPH - CRUSHTOOL (Ver. 10.2.11_jewel, OS. Centos7 ) 사용시 장애 포인트 ( 버그??)

CEPH - CRUSHMAP 으로 룰적용. (Ver. 10.2.11_jewel, OS. Centos7 ) with CRUSHTOOL

CEPH - CRUSHMAP 으로 룰적용. (Ver. 10.2.11_jewel, OS. Centos7 ) with CRUSHTOOL

CEPH - CRUSH RULE SET (Ver. 10.2.11_jewel, OS. Centos7 ) - with 삽질..

ceph - CRUSH RULE SET (Ver. 10.2.11_jewel, OS. Centos7) - with 삽질

About me

VIEW

cloud

Interested user

Blog Archive

Follow Us

LABEL

#ㅁㅇㄷ Nail by nailist Young