ceph-mon 無法在 rasberry pi (raspbian 8.0) 上啟動
ceph-mon 無法在 rasberry pi (raspbian 8.0) 上啟動
我最近購買了 3 個樹莓派節點來創建一個小型儲存集群,以便在家中進行測試。我發現了一些設置程序,所以看起來人們已經成功地做到了!
我正在執行 Raspbian GNU/Linux 8.0 (jessie)。我正在使用 ceph-deploy 安裝集群,它似乎安裝了 ceph ARM 軟體包的 10.2.5-7.2+rpi1 版本。
當我嘗試啟動 ceph-mon 服務時,我從 systemd 收到以下錯誤:
Dec 14 19:59:46 ceph-master systemd[1]: Starting Ceph cluster monitor daemon... Dec 14 19:59:46 ceph-master systemd[1]: Started Ceph cluster monitor daemon. Dec 14 19:59:47 ceph-master ceph-mon[28237]: *** Caught signal (Segmentation fault) ** Dec 14 19:59:47 ceph-master ceph-mon[28237]: in thread 756a5c30 thread_name:admin_socket Dec 14 19:59:47 ceph-master systemd[1]: ceph-mon@ceph-master.service: main process exited, code=killed, status=11/SEGV Dec 14 19:59:47 ceph-master systemd[1]: Unit ceph-mon@ceph-master.service entered failed state. Dec 14 19:59:47 ceph-master systemd[1]: ceph-mon@ceph-master.service holdoff time over, scheduling restart. Dec 14 19:59:47 ceph-master systemd[1]: Stopping Ceph cluster monitor daemon... Dec 14 19:59:47 ceph-master systemd[1]: Starting Ceph cluster monitor daemon... Dec 14 19:59:47 ceph-master systemd[1]: Started Ceph cluster monitor daemon. Dec 14 19:59:49 ceph-master ceph-mon[28256]: *** Caught signal (Segmentation fault) ** Dec 14 19:59:49 ceph-master ceph-mon[28256]: in thread 75654c30 thread_name:admin_socket Dec 14 19:59:49 ceph-master ceph-mon[28256]: ceph version 10.2.5 (c461ee19ecbc0c5c330aca20f7392c9a00730367) Dec 14 19:59:49 ceph-master ceph-mon[28256]: 1: (()+0x4b1348) [0x54fae348] Dec 14 19:59:49 ceph-master ceph-mon[28256]: 2: (__default_sa_restorer()+0) [0x768bb480] Dec 14 19:59:49 ceph-master ceph-mon[28256]: 3: (AdminSocket::do_accept()+0x28) [0x550ca154] Dec 14 19:59:49 ceph-master ceph-mon[28256]: 4: (AdminSocket::entry()+0x22c) [0x550cc458] Dec 14 19:59:49 ceph-master systemd[1]: ceph-mon@ceph-master.service: main process exited, code=killed, status=11/SEGV Dec 14 19:59:49 ceph-master systemd[1]: Unit ceph-mon@ceph-master.service entered failed state. Dec 14 19:59:49 ceph-master systemd[1]: ceph-mon@ceph-master.service holdoff time over, scheduling restart. Dec 14 19:59:49 ceph-master systemd[1]: Stopping Ceph cluster monitor daemon... Dec 14 19:59:49 ceph-master systemd[1]: Starting Ceph cluster monitor daemon... Dec 14 19:59:49 ceph-master systemd[1]: Started Ceph cluster monitor daemon. Dec 14 19:59:50 ceph-master ceph-mon[28271]: *** Caught signal (Segmentation fault) ** Dec 14 19:59:50 ceph-master ceph-mon[28271]: in thread 755fcc30 thread_name:admin_socket Dec 14 19:59:50 ceph-master systemd[1]: ceph-mon@ceph-master.service: main process exited, code=killed, status=11/SEGV Dec 14 19:59:50 ceph-master systemd[1]: Unit ceph-mon@ceph-master.service entered failed state. Dec 14 19:59:50 ceph-master systemd[1]: ceph-mon@ceph-master.service holdoff time over, scheduling restart. Dec 14 19:59:50 ceph-master systemd[1]: Stopping Ceph cluster monitor daemon... Dec 14 19:59:50 ceph-master systemd[1]: Starting Ceph cluster monitor daemon... Dec 14 19:59:50 ceph-master systemd[1]: ceph-mon@ceph-master.service start request repeated too quickly, refusing to start. Dec 14 19:59:50 ceph-master systemd[1]: Failed to start Ceph cluster monitor daemon. Dec 14 19:59:50 ceph-master systemd[1]: Unit ceph-mon@ceph-master.service entered failed state.
我在這裡尋找指導,因為我不確定為什麼這不起作用。我正在為我的 apt 儲存庫使用以下 URL:
root@ceph-master:~# cat /etc/apt/sources.list deb http://mirrordirector.raspbian.org/raspbian/ testing main contrib non-free rpi root@ceph-master:~# cat /etc/apt/sources.list.d/ceph.list deb https://download.ceph.com/debian-jewel/ jessie main
有沒有其他人嘗試過這個並且遇到過類似的問題?有關如何繼續或解決此問題的任何建議?
我終於讓 ceph 在我的 Raspberry Pi 3s 上工作。Ceph 儲存庫只有用於 arm64 架構的 ARM 包。所有 armhf 包都不見了。我認為 ceph 社區不再關心 32 位,而一直專注於 64 位。訣竅是安裝 Ubuntu 的 arm64 版本。download.ceph.com 上唯一可用的 arm64 軟體包用於 xenial 或 trusty。在多次嘗試安裝 64 位版本的 ubuntu 失敗後,我使用了 64 位版本的 debian,只是將儲存庫更改為指向:
deb http://ports.ubuntu.com/ubuntu-ports xenial main universe deb http://ports.ubuntu.com/ubuntu-ports xenial-updates main universe deb http://ports.ubuntu.com/ubuntu-ports xenial-security main universe
我在這裡找到了 64 位 rpi debian 映像:
https://github.com/bamarni/pi64/releases
我已經啟動並執行了 ceph luminous!
ceph-admin@ceph-node0:~/my-cluster$ sudo ceph -s cluster: id: 5fa3e5e2-04ea-407e-b6f1-8ef5a3b3b1f2 health: HEALTH_OK services: mon: 1 daemons, quorum ceph-node0 mgr: ceph-node0(active) osd: 3 osds: 3 up, 3 in data: pools: 0 pools, 0 pgs objects: 0 objects, 0 bytes usage: 3164 MB used, 168 GB / 171 GB avail pgs:
Dec 14 19:59:47 ceph-master ceph-mon[28237]: *** Caught signal (Segmentation fault) **
那是程式錯誤。ceph 正在訪問指向涅槃的指針。
正如@SYN 所說 - 使用更新版本的 ceph 可能會更幸運,其中可能會修復此錯誤。如果沒有,那麼您應該將錯誤以及上游日誌中可見的堆棧跟踪報告給 ceph 開發人員 - 當錯誤被擷取並修復時,每個人都應該很高興,尤其是在儲存軟體中。