Memory termination #476

naftaly · 2024-05-12T23:09:02Z

Adding OOM support to KSCrash.

In a gist, this works by writing an OOM report to disk outside of the report area for later modification. We also keep track of memory information through a memory mapped file for crash resistance, we write to it anytime memory limit or pressure changes. On termination, that data is basically a breadcrumb we can use on cold start.

On cold start, we look for that mapped file, if memory pressure is >= critical or memory level is >= critical it means we were terminated due to memory. At this point we'll read in the OOM report we write to disk on the previous app launch, modify it to fit our needs with latest information and move it to the report folder where the rest of the system will pick it up and send it in.

One thing that would make everything better is to have new repositories for each cold start. it can make handling files a lot more robust.

Sources/KSCrashRecording/KSCrashAppMemory.mm

Sources/KSCrashRecording/Monitors/KSCrashMonitor_Memory.mm

Sources/KSCrashRecording/Monitors/KSCrashMonitor_Memory.h

GLinnik21 · 2024-05-13T21:45:38Z

I'm really impressed by the clever way you've used memory-mapped files! It's a great solution. I just wanted to check in about something. How can we be sure that data cached in memory is going to be written to disc by the kernel in case of a crash?

naftaly · 2024-05-15T16:31:55Z

I'm really impressed by the clever way you've used memory-mapped files! It's a great solution. I just wanted to check in about something. How can we be sure that data cached in memory is going to be written to disc by the kernel in case of a crash?

The way I understand it is that mmap is handled in kernel space and not user space, which leads to file based mmap to basically be a “kernel cache” that is always dumped to disk based on the mapping sync functions, and a process being terminated is one of those instances where the kernel will explicitly call those sync functions, which leads to crash resilience.

naftaly · 2024-05-17T23:33:10Z

@GLinnik21 Can you reopen the PR so I can change the base to main and we can continue work on it?

GLinnik21 · 2024-05-17T23:38:12Z

Oh sorry. It got automatically closed because I merged and deleted release-2.0 to master. I can try to restore release-2.0 or you can try changing the target branch to kstenerud:master. What works better for you?

naftaly · 2024-05-17T23:41:51Z

I wasn't able to change the base for some reason, likely because the PR is closed.

naftaly · 2024-05-17T23:42:06Z

Oh, there we go :) thank you.

Sources/KSCrashRecording/Monitors/KSCrashMonitor_Memory.m

Sources/KSCrashRecording/include/KSCrashAppMemory.h

Sources/KSCrashRecording/include/KSCrashMonitorType.h

GLinnik21 · 2024-05-19T14:26:57Z

Could you add some tests to cover the new functionality where possible? It would help maintain the quality and stability of the project. Thanks!

naftaly · 2024-05-19T19:47:29Z

Could you add some tests to cover the new functionality where possible? It would help maintain the quality and stability of the project. Thanks!

I've added a few tests. If there's anything in particular you'd like to be tested feel free to point it out, I'm happy to add more.

Sources/KSCrashRecording/Monitors/KSCrashMonitor_Memory.h

Sources/KSCrashRecording/Monitors/KSCrashMonitor_Memory.m

Sources/KSCrashRecording/Monitors/KSCrashMonitor_NSException.m

Sources/KSCrashRecording/include/KSCrashC.h

Sources/KSCrashRecording/include/KSCrashReportFields.h

naftaly · 2024-05-21T15:10:01Z

Any clues what’s going on with this check, I don’t see what’s wrong.

GLinnik21 · 2024-05-21T15:11:27Z

Any clues what’s going on with this check, I don’t see what’s wrong.

Let's try to rerun for now.

GLinnik21 · 2024-05-21T18:16:19Z

I wanted to share a thought on organizing our code. When we put multiple class implementations in a single file, especially in both implementation and header files, it can make the code harder to read and maintain. Keeping everything in separate files can really simplify code reading and understanding. Some of our files with multiple implementations have grown to several hundred lines of code, which can be overwhelming. What do you think about splitting them up to make things more manageable?

Update the context to have info about if we're currently in a crash handler.

We might want ot think of adding Mac Catalyst to KSCRASH_HAS_UIAPPLICATION.

enabled fatal reporting as well.

naftaly · 2024-05-24T13:53:09Z

@bamx23 @GLinnik21 I addressed the last few pieces of feedback. I found a simple way to enable/disable reporting OOMs but keep the data flowing and added to other types of reports.

I also wanted to let both of you know I wrote a small piece about what we did here. https://medium.com/@alexandercohen/reducing-memory-terminations-in-ios-apps-3e76797ca5bd

naftaly · 2024-05-24T14:59:56Z

Could you folks rerun the actions, they seem to have failed due to a timeout.

GLinnik21 · 2024-05-24T15:05:48Z

@GLinnik21 do you have anything left to discuss here or are you good to merge?

As always, the more tests, the better. If other code is too low-level to test, I'm fine with that.

GLinnik21 · 2024-05-24T15:19:36Z

I also wanted to let both of you know I wrote a small piece about what we did here. https://medium.com/@alexandercohen/reducing-memory-terminations-in-ios-apps-3e76797ca5bd

Wow! I read the whole article and found it very insightful. Thank you for the shoutout!

bamx23

Great! Let's go!

naftaly mentioned this pull request May 12, 2024

Memory Termination Support #474

Closed

GLinnik21 linked an issue May 12, 2024 that may be closed by this pull request

Memory Termination Support #474

Closed

GLinnik21 reviewed May 13, 2024

View reviewed changes

GLinnik21 deleted the branch kstenerud:master May 17, 2024 14:34

GLinnik21 closed this May 17, 2024

GLinnik21 reopened this May 17, 2024

naftaly changed the base branch from release-2.0 to master May 17, 2024 23:42

GLinnik21 reviewed May 18, 2024

View reviewed changes

Sources/KSCrashRecording/Monitors/KSCrashMonitor_Memory.m Outdated Show resolved Hide resolved

naftaly commented May 18, 2024

View reviewed changes

Sources/KSCrashRecording/Monitors/KSCrashMonitor_Memory.m Show resolved Hide resolved

GLinnik21 reviewed May 18, 2024

View reviewed changes

Sources/KSCrashRecording/include/KSCrashAppMemory.h Show resolved Hide resolved

naftaly marked this pull request as ready for review May 18, 2024 23:44

naftaly requested a review from GLinnik21 May 18, 2024 23:44

naftaly commented May 18, 2024

View reviewed changes

Sources/KSCrashRecording/include/KSCrashMonitorType.h Outdated Show resolved Hide resolved

GLinnik21 requested a review from bamx23 May 19, 2024 13:00

GLinnik21 reviewed May 20, 2024

View reviewed changes

Sources/KSCrashRecording/Monitors/KSCrashMonitor_NSException.m Outdated Show resolved Hide resolved

GLinnik21 reviewed May 20, 2024

View reviewed changes

Sources/KSCrashRecording/include/KSCrashC.h Show resolved Hide resolved

GLinnik21 reviewed May 20, 2024

View reviewed changes

Sources/KSCrashRecording/include/KSCrashReportFields.h Outdated Show resolved Hide resolved

naftaly requested a review from GLinnik21 May 20, 2024 20:02

naftaly added 22 commits May 24, 2024 09:24

Added missing include

bd67b61

Fix a BOOL to bool issue

1156f56

Cleaned up docs and passed validation

c1601ff

Cleaned up AppStateTracker and observers to make the objects weak

fda7641

Added a fatal flag

b6346cf

Update the context to have info about if we're currently in a crash handler.

Added a few tests.

78b1bcb

Fixed some tests

f4de018

Addressed a few comments

06ae16e

Move app state tracker into its own file

97160ca

Added some tests

17d0d1f

App State Tracker tests

be2b410

Added more async safety

07ba253

touch

429c866

Moved App Memory Tracker into it's own file

a50a59b

Make fatal a bool

ab8c3c3

Addressed a few comments

ed4b1d1

Call KSCrash.sharedInstance early

0888a92

Addressed some feedback.

11e2c2c

Added feedbacl

52e612b

Lifecycle events now work on all correct platforms.

dde7d88

We might want ot think of adding Mac Catalyst to KSCRASH_HAS_UIAPPLICATION.

Addressed latest feedback

1559551

enabled fatal reporting as well.

Fixed a type issue

4969928

naftaly force-pushed the Memory-Termination branch from e82109b to 4969928 Compare May 24, 2024 13:24

Addressed feedback.

8594117

bamx23 approved these changes May 24, 2024

View reviewed changes

bamx23 merged commit 992a1c6 into kstenerud:master May 24, 2024
19 checks passed

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory termination #476

Memory termination #476

naftaly commented May 12, 2024 •

edited

Loading

GLinnik21 commented May 13, 2024

naftaly commented May 15, 2024

naftaly commented May 17, 2024

GLinnik21 commented May 17, 2024

naftaly commented May 17, 2024

naftaly commented May 17, 2024

GLinnik21 commented May 19, 2024

naftaly commented May 19, 2024

naftaly commented May 21, 2024

GLinnik21 commented May 21, 2024

GLinnik21 commented May 21, 2024

naftaly commented May 24, 2024

naftaly commented May 24, 2024

GLinnik21 commented May 24, 2024

GLinnik21 commented May 24, 2024

bamx23 left a comment

Memory termination #476

Memory termination #476

Conversation

naftaly commented May 12, 2024 • edited Loading

GLinnik21 commented May 13, 2024

naftaly commented May 15, 2024

naftaly commented May 17, 2024

GLinnik21 commented May 17, 2024

naftaly commented May 17, 2024

naftaly commented May 17, 2024

GLinnik21 commented May 19, 2024

naftaly commented May 19, 2024

naftaly commented May 21, 2024

GLinnik21 commented May 21, 2024

GLinnik21 commented May 21, 2024

naftaly commented May 24, 2024

naftaly commented May 24, 2024

GLinnik21 commented May 24, 2024

GLinnik21 commented May 24, 2024

bamx23 left a comment

Choose a reason for hiding this comment

naftaly commented May 12, 2024 •

edited

Loading