resources:
limits:
cpu: "2"
memory: 8000Mi --> 16000Mi
requests:
cpu: "2"
memory: 8000Mi --> 16000Mi
ts=2023-05-18T01:00:36.619Z caller=head.go:493 level=info component=tsdb msg="Replaying on-disk memory mappable chunks if any"
ts=2023-05-18T01:00:36.670Z caller=head.go:536 level=info component=tsdb msg="On-disk memory mappable chunks replay completed" duration=51.105706ms
ts=2023-05-18T01:00:36.670Z caller=head.go:542 level=info component=tsdb msg="Replaying WAL, this may take a while"
ts=2023-05-18T01:00:46.073Z caller=head.go:578 level=info component=tsdb msg="WAL checkpoint loaded"
ts=2023-05-18T01:00:48.380Z caller=head.go:613 level=info component=tsdb msg="WAL segment loaded" segment=3945 maxSegment=3948
ts=2023-05-18T01:00:54.070Z caller=head.go:613 level=info component=tsdb msg="WAL segment loaded" segment=3946 maxSegment=3948
ts=2023-05-18T01:00:54.175Z caller=head.go:613 level=info component=tsdb msg="WAL segment loaded" segment=3947 maxSegment=3948
ts=2023-05-18T01:00:54.176Z caller=head.go:613 level=info component=tsdb msg="WAL segment loaded" segment=3948 maxSegment=3948
ts=2023-05-18T01:00:54.176Z caller=head.go:619 level=info component=tsdb msg="WAL replay completed" checkpoint_replay_duration=9.402768009s wal_replay_duration=8.103580105s total_replay_duration=17.557514431s
ts=2023-05-18T01:00:54.733Z caller=main.go:991 level=warn fs_type=NFS_SUPER_MAGIC msg="This filesystem is not supported and may lead to data corruption and data loss. Please carefully read https://prometheus.io/docs/prometheus/latest/storage/ to learn more about supported filesystems."
ts=2023-05-18T01:00:54.733Z caller=main.go:996 level=info msg="TSDB started"
ts=2023-05-18T01:00:54.734Z caller=main.go:1177 level=info msg="Loading configuration file" filename=/etc/config/prometheus.yml
ts=2023-05-18T01:00:54.736Z caller=kubernetes.go:325 level=info component="discovery manager scrape" discovery=kubernetes msg="Using pod service account via in-cluster config"
ts=2023-05-18T01:00:54.737Z caller=kubernetes.go:325 level=info component="discovery manager scrape" discovery=kubernetes msg="Using pod service account via in-cluster config"
ts=2023-05-18T01:00:54.737Z caller=kubernetes.go:325 level=info component="discovery manager scrape" discovery=kubernetes msg="Using pod service account via in-cluster config"
ts=2023-05-18T01:00:54.737Z caller=kubernetes.go:325 level=info component="discovery manager scrape" discovery=kubernetes msg="Using pod service account via in-cluster config"
ts=2023-05-18T01:00:54.737Z caller=kubernetes.go:325 level=info component="discovery manager notify" discovery=kubernetes msg="Using pod service account via in-cluster config"
ts=2023-05-18T01:00:54.737Z caller=main.go:1214 level=info msg="Completed loading of configuration file" filename=/etc/config/prometheus.yml totalDuration=3.935451ms db_storage=1.792µs remote_storage=4.519µs web_handler=676ns query_engine=1.473µs scrape=277.815µs scrape_sd=1.094468ms notify=72.292µs notify_sd=174.763µs rules=30.14µs tracing=10.05µs
ts=2023-05-18T01:00:54.737Z caller=main.go:957 level=info msg="Server is ready to receive web requests."
ts=2023-05-18T01:00:54.738Z caller=manager.go:937 level=info component="rule manager" msg="Starting rule manager..."
→ 정상 기동 이후 재기동 시, 비정상 종료했을 때보다 더 적은 WAL Segment가 load 되는 것을 확인 가능
https://engineering.linecorp.com/ko/blog/prometheus-container-kubernetes-cluster
https://stackoverflow.com/questions/63541085/kubernetes-prometheus-crashloopbackoff-oomkilled-puzzle
https://blog.naver.com/PostView.nhn?blogId=alice_k106&logNo=221829384846