-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Chance pairs? #297
Comments
The interface looks good. I'd call I think the result should at least include pairs of IDs and times at which they crossed the distance tolerance. So perhaps the result could be a list of tuples of What is |
I am thinking of chance pairs existing in two dimensions: distance and time. As an example you can then search for particles that came within 1 km and 1 day of each other. So we might want to define a single norm based on these parameters. |
Just interested in this issue. Some papers use a range of distance to select chance pair. For example, given an initial separation I am also interested in the returning of this function. Does this return particle pairs (which is linked to this issue)? |
A-ha, so it's actually useful to search for pairs where |
That's good. But I am not sure about the time, as many Lagrangian data have uniform time intervals (e.g., 1hr or 6hr). Not sure if time range is necessary or not. Maybe it is necessary for the raw data. |
Thanks for tagging me here. I am definitely interested in this function. A few of things come to mind, which might be worth thinking about.
|
The algorithm for this is pretty simple:
of course, we don't want to implement O(N^2) algorithm so we could initially create a KD-Tree to speed up searching for nearby points. Are there any other known optimizations for this? |
Consider a |
I say AND |
I'll work on this next. |
Could we write a function that would efficiently find chance pairs in a Lagrangian dataset? @dhruvbalwada how would you like such a function to look like in terms of its input/output arguments?
Input arguments could be:
where
rowsize
is a list of integers specifying the number of data points in each row of the input ragged arrayslon
,lat
,time
and the other arguments specify the maximum spatial and temporal distances defining the chance encounters? What should be the output? Pairs of ids that would need to be passed? double indices of rows and columns in rows?The text was updated successfully, but these errors were encountered: