Distributed regridding v2 - source data on distributed space #1175

juliasloan25 · 2023-04-07T22:07:09Z

This is the second PR of ClimaCoupler SDI #188, the first being #1107.

Major goals to accomplish in this PR

Eliminate the use of serial spaces in distributed remapping functions.
Make use of MPI to share information between processes, rather than constructing objects on serial spaces and broadcasting as before.

Specific changes to be implemented in this PR

Generate source data on the distributed source space only. Send all source data to all processes using DSS.
- This is not the ideal implementation since we are sending more information than is necessary. This will be modified in the future, but is a good next step to take at this point.
Add source_global_elem_lidx field to LinearMap. Use this to perform matrix multiplication for local indices of source data only in remap!. Since at this point all processes have all the source data, this approach will include redundant multiplication. However, this setup will be useful when we use the super-halo exchange to send only the necessary source data.
Create two methods for the generate_map function, one for the serial case and one for the distributed case.
In the distributed case of generate_map, use only the distributed source and target spaces (i.e. no serial spaces).
- Extend Spaces.unique_nodes to correctly return the number of unique nodes in a distributed space.

The most involved component of this PR will be extending Spaces.unique_nodes to handle a distributed space as input. This will require developing an algorithm to count the unique nodes and then implementing it in the code. This may be best done as a separate PR to minimize scope of each PR.

The other non-trivial component of this PR is sending the source data to all processes. We should be able to use the existing DSS code, but we may need to add some data structures and functions to our code to use it. We should develop this part with the super-halo implementation in mind, so that any relevant infrastructure can easily be extended for that case.

QA

Code follows the style guidelines OR N/A.
Unit tests are included OR N/A.
Code is exercised in an integration test OR N/A.
Documentation has been added/updated OR N/A.

source idx map in LinearMap, use distr spaces in remap

5ee4afa

juliasloan25 added the distributed label Apr 7, 2023

juliasloan25 self-assigned this Apr 7, 2023

juliasloan25 mentioned this pull request Apr 11, 2023

Extend unique_nodes for distributed spaces #1181

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed regridding v2 - source data on distributed space #1175

Distributed regridding v2 - source data on distributed space #1175

juliasloan25 commented Apr 7, 2023

Distributed regridding v2 - source data on distributed space #1175

Are you sure you want to change the base?

Distributed regridding v2 - source data on distributed space #1175

Conversation

juliasloan25 commented Apr 7, 2023

Major goals to accomplish in this PR

Specific changes to be implemented in this PR

QA