unassigned shards 발생 원인 및 해결방법

개요

kibana에서 elasticsearch status를 확인해보니 red가 되었음을 확인하였다.
status red란 primary shards 중 unassigned 된 것이 있다는 의미이다.
원인은 node 수에 비해 지나치게 많은 인덱스가 생성되어 unassigned 된 것이었다.
node 수를 늘리거나 불필요한 인덱스를 제거하여 해결할 수 있다.
replica 수 변경으로 샤드 할당에 실패한 것일 수도 있으니 replica 수도 조정한다.

elasticsearch 상태 확인

  # 결과에서 unassinged_shards를 확인한다.
  curl -XGET "localhost:9200/_cluster/health?pretty"

  # unassigned 샤드 상태 조회(assigned shards 들을 확인할 수 있다.)
  curl "https://localhost:9200/_cat/shards" | grep "UNASSIGNED"

문제 샤드 발생 원인 탐색

  curl -XGET "localhost:9200/_cluster/allocation/explain?pretty"

대표적인 문제 샤드 발생 원인

thorttle
- 에러 원문
  - “explanation” : “reached the limit of ongoing initial primary recoveries [4], cluster setting [cluster.routing.allocation.node_initial_primaries_recoveries=4]”
- 현상
  - 노드 수 대비 너무 많은 샤드가 생성된 것
  - elasticsearch 재시작 시 일시적으로 걸리는 경우가 있다.
- 해결방법
  - 노드 수를 늘리거나 인덱스 수를 줄인다.

same shard

에러 원문
- “explanation” : “the shard cannot be allocated to the same node on which a copy of the shard already exists [[my-index-2022.06.13][4], node[OnXt_GFVQzeLNXSDFJKLSDNF], [P], s[STARTED], a[id=4daKbROAJDKFHSDJKFBKJS]]”
현상
- replica 수가 node 수보다 많아 같은 node에 같은 샤드가 할당된 경우

해결방법

replica 수를 0으로 바꾸고, 모든 샤드를 재할당한 후 다시 replicas 수를 조정한다.

명령

  # replica 수 0으로 변경
  curl -XPUT \
      "http://localhost:9200/_settings" \
      -H 'Content-Type: application/json' \
      -d '{
          "index" : {
              "number_of_replicas" : 0
          }
      }'
                
  # replica 수 변경되었는지 확인
  curl "localhost:9200/_cluster/stats?pretty" | grep "replication"

조치 후 샤드 재할당

  # 자동 할당 설정 활성화
  curl -XPUT 'http://localhost:9200/_cluster/settings' \
      -H 'Content-Type: application/json' \
      -d '{
          "transient" : {
              "cluster.routing.allocation.enable" : "all"
          }
      }'

  # 샤드 재할당
  curl -XPOST "http://localhost:9200/_cluster/reroute?retry_failed"

  # unassigned shards가 존재하는지 확인
  curl "https://localhost:9200/_cat/shards" | grep "UNASSIGNED"

unassigned shards 발생 원인 및 해결방법

개요

elasticsearch 상태 확인

문제 샤드 발생 원인 탐색

대표적인 문제 샤드 발생 원인

조치 후 샤드 재할당

참고

Trending Tags

unassigned shards 발생 원인 및 해결방법

개요

elasticsearch 상태 확인

문제 샤드 발생 원인 탐색

대표적인 문제 샤드 발생 원인

조치 후 샤드 재할당

참고

Further Reading

우분투에 ELK(Elasticsearch, Logstash, Kibana) 구축해보기

docker container 환경에서 ELK(Elasticsearch, Logstash, Kibana)로 로깅해보기

circuit_breaking_exception, [parent] Data too large 에러

Trending Tags