Published September 29, 2025 © MIT

Autonomous Salinity Control on Coral Nursery Systems

The control systems based on Reinforcement Learning (RL)

IntermediateWork in progress44

Autonomous Salinity Control on Coral Nursery Systems

Story

Corals have important roles in our ocean. Corals can be a shelter and food source for some marine organisms. Diverse and healthy corals can also support the coastal community income. The fishes that live and find food around the corals might be the source income for the fisherman there. Moreover, corals can protect the shoreline from erosion and protect the coastal people who live close to the shore from the big waves. The wave going to the shore will be absorbed by the corals.

As corals have the important roles for us and our marine ecosystem, they should be preserved. Conducting coral conservation is one of the ways to help preserving corals so that they remain healthy and diverse. There are two kinds of conservations. One is in-situ conservation and another one is ex-situ conservation. In-situ conservation is a kind of conservation where the conservation is conducted in their natural habitat. In another hand, Ex-situ conservation is a kind of conservation where it is not conducted in their natural habitat. The benefit of conducting the ex-situ conservation is it is less effort compared to in-situ conservation. We don’t need to go diving every time we want to check the corals down there. One example of ex-situ conservation is coral nursery systems.

In coral nursery systems, salinity is one of important parameters that need more attention. The standard salinity value for coral nursery systems is between 34 parts per thousand (ppt) and 36 parts per thousand (ppt). To maintain the salinity to be in this range, is quite tricky. There are a lot of factors that can change the salinity. One of factors that can change it is evaporation. Therefore, we need to either add more seawater or freshwater. The drawback if this done manually is the worker sometimes forget to stop adding either seawater or freshwater. Based on this, the author proposes the technology about the autonomous salinity control on coral nursery system using Reinforcement Learning (RL). If the current salinity that measured by the sensor doesn't meet the target salinity, the system will add the seawater or the fresh water based on the RL method. The design has only been tested in simulation. The simulation was run in Jupyter Notebook. These are the steps before simulating it.

Generating the estimate model of the system
Defining the agent, observation space, and action space for the Reinforcement Learning (RL)
Implementing all of those three into the code
Train the model
Test the model with arbitrary value of initial salinity

Generating the estimate model of the system

The model used here is based on mass-balance model:

model with its all parameters

Reinforcement Learning (RL)

There are four parameters that needs to be defined. They are agents, environment, action, and reward. Following are those four parameters in the coral nursery system.

Agent: PPO (Proximal Policy Optimization)
Action : inflow rate
The environment : Model
Reward

r stands for reward

Simulation Result

I have tested the model that is been trained in the simulation. I set three different values for the initial salinity.

Salinity 32.13 ppt

Salinity 34.95 ppt

Salinity 34.35 ppt

From those three images, we can see that the actual salinity try catching up the desired salinity or equilibrium salinity at 34.5 ppt even though there is still small steady state error.

all dependencies

import os
import gymnasium as gym
from gymnasium import spaces
from stable_baselines3 import PPO
import numpy as np
from stable_baselines3.common.env_util import make_vec_env
import matplotlib.pyplot as plt


# class for the environment
class PondSalinityEnv(gym.Env):
    def __init__(self):
        super().__init__()

        # Observation space: salinity in ppt
        self.observation_space = spaces.Box(
            low=0, high=45, shape=(1,), dtype=np.float32
        )

        # Action space: 3 discrete inflow options
        self.action_space = spaces.Discrete(3)

        self.current_step = 0
        self.max_step = 200
        self.salinity = None

        # System parameters
        self.dt = 1.0      # [h]
        self.V = 1000.0    # [m^3]
        self.C_in = 35.0   # [ppt]
        self.S_env = 34.5  # [ppt]
        self.k = 0.05     # [1/h]

    def reset(self, seed=None, options=None):
        super().reset(seed=seed)
        self.salinity = np.random.uniform(34.25, 34.75)
        self.current_step = 0
        return np.array([self.salinity], dtype=np.float32), {}

    def mapping_value(self, action):
        return {0: 0.0, 1: 0.1, 2: 0.5}[action]

    def step(self, action):
        Q_in = self.mapping_value(action)
        dS = (Q_in * self.C_in / self.V) - self.k * (self.salinity - self.S_env)
        self.salinity += self.dt * dS
        self.salinity = np.clip(self.salinity, 0, 45)

        obs = np.array([self.salinity], dtype=np.float32)
        reward = -abs(self.salinity - self.S_env)

        self.current_step += 1
        terminated = self.current_step >= self.max_step
        truncated = False
        return obs, reward, terminated, truncated, {}

    def render(self):
        print(f"Step={self.current_step} | Salinity={self.salinity:.2f} ppt")

    def close(self):
        pass
      
#########################################################      

# for model training      
# Import your environment
env = PondSalinityEnv()

# Wrap with vec_env (needed for PPO)
env = make_vec_env(lambda: PondSalinityEnv(), n_envs=1)

# Define PPO model
model = PPO(
    "MlpPolicy",
    env,
    verbose=1,
    learning_rate=3e-4,
    gamma=0.99,
    n_steps=2048,
    batch_size=64,
    ent_coef=0.01
)

# Train
model.learn(total_timesteps=500_000)

# Save model
model.save("ppo_salinity_1D")

env.close()
#######################################


# for testing the model
# Load model
model = PPO.load("ppo_salinity_1D")

# Create new env instance (not vec_env here)
env = PondSalinityEnv()

# Reset env
obs, _ = env.reset()
print("Initial observation:", obs)

# Logs
salinity_log = []
setpoint_log = []
time_log = []

setpoint = 34.5
n = 200  # shorter horizon for plotting

for t in range(n):
    # action, _ = model.predict(obs, deterministic=True)
    # obs, reward, terminated, truncated, info = env.step(action)

    action, _ = model.predict(obs, deterministic=True)
    obs, reward, terminated, truncated, info = env.step(int(action))

    salinity_log.append(obs[0])   # now obs is 1D
    setpoint_log.append(setpoint)
    time_log.append(t)

    if terminated or truncated:
        break

# Plot
plt.figure(figsize=(10,5))
plt.plot(time_log, salinity_log, label="Actual Salinity")
plt.plot(time_log, setpoint_log, 'r--', label="Desired Salinity (Setpoint)")
plt.xlabel("Time step")
plt.ylabel("Salinity (ppt)")
plt.title("Pond Salinity Control with RL Agent (1D state)")
plt.legend()
plt.grid(True)
plt.show()

########################################

Credits

Jenanaputra

1 project • 0 followers

An underwater robotic developer who passionate on marine things.

Autonomous Salinity Control on Coral Nursery Systems

Story

Generating the estimate model of the system

Reinforcement Learning (RL)

Simulation Result

Code

all dependencies

Credits

Jenanaputra

Comments

Embed the widget on your own site

Autonomous Salinity Control on Coral Nursery Systems

Autonomous Salinity Control on Coral Nursery Systems

Story

Generating the estimate model of the system

Reinforcement Learning (RL)

Simulation Result

Code

all dependencies

Credits

Jenanaputra

Comments

Related channels and tags