Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft: Tobi Dance Branch #12

Draft
wants to merge 43 commits into
base: main
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
43 commits
Select commit Hold shift + click to select a range
68bf537
Starting template for integrating vector of target positions
marcinpaluch1994 Nov 24, 2022
6a6d218
to fix tensorflow JIT compile case insentivity problem, renamed cart …
tobidelbruck Nov 27, 2022
d1d9203
docstring for update_parameters
tobidelbruck Nov 28, 2022
cc2b58c
controller_mpc.py now copies all modified controller, cost and optimi…
tobidelbruck Nov 28, 2022
06c7981
rename config_cost_function.yml to config_cost_functions.yml for cons…
tobidelbruck Nov 28, 2022
068a0ac
finally the dynamically modifiable control cost parameters are workin…
tobidelbruck Dec 11, 2022
64148a3
now spin and balance both work! and so does changing the policy and …
tobidelbruck Dec 12, 2022
fef38b0
got basic shimmy movement to work now. added helper vars to access co…
tobidelbruck Dec 12, 2022
d2e9942
added cartonly trajectory and fixed bug that erased the target positi…
tobidelbruck Dec 12, 2022
03fa3af
passing current state to cartpole_trajectory_generator.py so it can e…
tobidelbruck Dec 12, 2022
ab0f34e
added MPPI papers to docstring
tobidelbruck Dec 13, 2022
7ad7be1
added MPPI papers to docstring
tobidelbruck Dec 13, 2022
86c8d68
Rename num_rollouts -> batch_size in config templates
frehe Dec 14, 2022
3ade5d0
add comment
tobidelbruck Dec 18, 2022
56d3e51
local changes, all minor except for trajectory cost that is in flux
tobidelbruck Dec 18, 2022
9dad3a7
Merge remote-tracking branch 'origin/Tobi_Dance' into Tobi_Dance
tobidelbruck Dec 18, 2022
67ba599
renamed s to state for clariy in many of the classes.
Dec 24, 2022
1f324f4
added dancer that reads CSV file to specify sequence of 'steps' (beha…
Dec 26, 2022
0b543a0
fixed import of CompileTF to point to SI_Toolkit
tobidelbruck Jan 28, 2023
8fe2a96
Merge remote-tracking branch 'origin/main' into Tobi_Dance
tobidelbruck Jan 31, 2023
c0ee620
merged from Tobi_Dance and added some loggers
tobidelbruck Feb 3, 2023
ee9ca38
moved get_logger to own file in SI_Toolkit
tobidelbruck Feb 6, 2023
ecf2cc8
Merge remote-tracking branch 'origin/Tobi_Dance' into Tobi_Dance
tobidelbruck Feb 7, 2023
69b886c
added search path for running from physical-cartpole.
tobidelbruck Feb 7, 2023
4ce131e
update path to config_cost_functions.yml
tobidelbruck Feb 7, 2023
32c0a50
move get_logger.py to Control_Toolkit so that it can be used by physi…
tobidelbruck Feb 8, 2023
45f1d56
Merge remote-tracking branch 'origin/Tobi_Dance' into Tobi_Dance
tobidelbruck Feb 8, 2023
f0ec1d1
cartpole_dancer.py starts to work. Music starts and stops, some steps…
tobidelbruck Feb 10, 2023
5d13e63
improved control slightly by adding back more cost terms to provide s…
tobidelbruck Feb 11, 2023
e2162fd
added some docstrings, but they are not very informative
tobidelbruck Feb 12, 2023
518a00f
added prediction and target trajectory to logging to allow model mism…
tobidelbruck Feb 13, 2023
9190c2a
add computation of pole natural frequency to p_globals.py.
tobidelbruck Feb 14, 2023
6b8a676
added 'cartwheel' step to cartpole_trajectory_generator.py.
tobidelbruck Feb 16, 2023
039d385
added warning for cart calibration.
tobidelbruck Feb 19, 2023
ed2c118
fixed some logic and reduced some loggers to debug level
tobidelbruck Feb 19, 2023
a16b454
fixed shimmy math.
tobidelbruck Feb 20, 2023
000ff67
Merge remote-tracking branch 'origin/Tobi_Dance' into Tobi_Dance
tobidelbruck Feb 20, 2023
ea33a00
improved console reporting of current objective and logging output so…
tobidelbruck Feb 21, 2023
6e8ef51
improved logging output to make debug logger light gray, include file…
tobidelbruck Feb 21, 2023
e894dfc
fixed get_logger.py that now uses a single logger name to only add th…
tobidelbruck Feb 22, 2023
3489a44
reduced chatter in logging
tobidelbruck Feb 23, 2023
a1a36db
major changes to cartpole_dancer_cost and cartpole_trajectory_generat…
tobidelbruck Feb 28, 2023
807c46d
initial commit of Shreyan's code for energy-based controller for cart…
tobidelbruck May 13, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,8 @@
import os

from Control_Toolkit.Controllers import template_controller
from others.globals_and_utils import create_rng
from others.globals_and_utils import create_rng, update_attributes


# TODO: You can load and access config files here, like this:
# config = yaml.load(open("config.yml", "r"), Loader=yaml.FullLoader)
Expand All @@ -31,10 +32,11 @@ def configure(self):
# u = 0.0
pass

def step(self, s: np.ndarray, time=None, updated_attributes: "dict[str, TensorType]" = {}):
def step(self, state: np.ndarray, time=None, updated_attributes: "dict[str, TensorType]" = {}):

# The controller has to adapt when environment-related attributes such as target positions change
# Updated targets etc. are passed as a dictionary updated_attributes
self.update_attributes(updated_attributes) # After this call, updated attributes are available as self.<<attribute_name>>
update_attributes(updated_attributes,self) # After this call, updated attributes are available as self.<<attribute_name>>

# TODO: Implement your controller here
# Examples:
Expand Down
16 changes: 8 additions & 8 deletions Control_Toolkit_ASF_Template/Controllers/controller_do_mpc.py
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@
from CartPole.cartpole_model import Q2u, cartpole_ode_namespace
from CartPole.state_utilities import cartpole_state_vector_to_namespace
from Control_Toolkit.Controllers import template_controller
from others.globals_and_utils import create_rng
from others.globals_and_utils import create_rng, update_attributes
from SI_Toolkit.computation_library import NumpyLibrary, TensorType


Expand Down Expand Up @@ -136,16 +136,16 @@ def tvp_fun(self, t_ind):
return self.tvp_template


def step(self, s: np.ndarray, time=None, updated_attributes: "dict[str, TensorType]" = {}):
self.update_attributes(updated_attributes)
def step(self, state: np.ndarray, time=None, updated_attributes: "dict[str, TensorType]" = {}):
update_attributes(updated_attributes,self)

s = cartpole_state_vector_to_namespace(s)
state = cartpole_state_vector_to_namespace(state)

self.x0['s.position'] = s.position
self.x0['s.positionD'] = s.positionD
self.x0['s.position'] = state.position
self.x0['s.positionD'] = state.positionD

self.x0['s.angle'] = s.angle
self.x0['s.angleD'] = s.angleD
self.x0['s.angle'] = state.angle
self.x0['s.angleD'] = state.angleD

self.tvp_template['_tvp', :, 'target_position'] = self.target_position

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,7 @@
from CartPole.state_utilities import cartpole_state_vector_to_namespace
from Control_Toolkit.Controllers import template_controller
from SI_Toolkit.computation_library import NumpyLibrary, TensorType
from others.globals_and_utils import update_attributes


def mpc_next_state(s, u, dt):
Expand Down Expand Up @@ -147,16 +148,16 @@ def configure(self):
def tvp_fun(self, t_ind):
return self.tvp_template

def step(self, s: np.ndarray, time=None, updated_attributes: "dict[str, TensorType]" = {}):
self.update_attributes(updated_attributes)
def step(self, state: np.ndarray, time=None, updated_attributes: "dict[str, TensorType]" = {}):
update_attributes(updated_attributes,self)

s = cartpole_state_vector_to_namespace(s)
state = cartpole_state_vector_to_namespace(state)

self.x0['s.position'] = s.position
self.x0['s.positionD'] = s.positionD
self.x0['s.position'] = state.position
self.x0['s.positionD'] = state.positionD

self.x0['s.angle'] = s.angle
self.x0['s.angleD'] = s.angleD
self.x0['s.angle'] = state.angle
self.x0['s.angleD'] = state.angleD

self.tvp_template['_tvp', :, 'target_position'] = self.target_position

Expand Down
8 changes: 4 additions & 4 deletions Control_Toolkit_ASF_Template/Controllers/controller_lqr.py
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@
from CartPole.state_utilities import (ANGLE_IDX, ANGLED_IDX, POSITION_IDX,
POSITIOND_IDX)
from Control_Toolkit.Controllers import template_controller
from others.globals_and_utils import create_rng
from others.globals_and_utils import create_rng, update_attributes

config = yaml.load(open("config.yml", "r"), Loader=yaml.FullLoader)
actuator_noise = config["cartpole"]["actuator_noise"]
Expand Down Expand Up @@ -80,11 +80,11 @@ def configure(self):
self.X = X
self.eigVals = eigVals

def step(self, s: np.ndarray, time=None, updated_attributes: "dict[str, TensorType]" = {}):
self.update_attributes(updated_attributes)
def step(self, state: np.ndarray, time=None, updated_attributes: "dict[str, TensorType]" = {}):
update_attributes(updated_attributes,self)

state = np.array(
[[s[POSITION_IDX] - self.target_position], [s[POSITIOND_IDX]], [s[ANGLE_IDX]], [s[ANGLED_IDX]]])
[[state[POSITION_IDX] - self.target_position], [state[POSITIOND_IDX]], [state[ANGLE_IDX]], [state[ANGLED_IDX]]])

Q = np.dot(-self.K, state).item()

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@


# TODO: Load constants from the cost config file, like this:
config = yaml.load(open(os.path.join("Control_Toolkit_ASF", "config_cost_function.yml"), "r"), Loader=yaml.FullLoader)
config = yaml.load(open(os.path.join("Control_Toolkit_ASF", "config_cost_functions.yml"), "r"), Loader=yaml.FullLoader)

# TODO: Rename parent folder from EnvironmentName to the actual name of you environment
# TODO: Load constants like this:
Expand All @@ -23,7 +23,7 @@
class cost_function_barebone(cost_function_base):
"""This class can contain arbitrary helper functions to compute the cost of a trajectory or inputs."""
MAX_COST = 0.0 # Define maximum value the cost can take. Used for shifting

# Example: Cost for difference from upright position
# def _E_pot_cost(self, angle):
# """Compute penalty for not balancing pole upright (penalize large angles)"""
Expand All @@ -39,8 +39,8 @@ def get_terminal_cost(self, terminal_states: TensorType):
# return terminal_cost
pass

# all stage costs together
def _get_stage_cost(self, states: TensorType, inputs: TensorType, previous_input: TensorType):
# all stage costs together. A 'stage' is one timestep of a rollout.
def get_stage_cost(self, states: TensorType, inputs: TensorType, previous_input: TensorType):
# Shape of states: [batch_size, mpc_horizon, num_states]
# TODO: Compute stage cost
# return stage_cost
Expand Down
2 changes: 0 additions & 2 deletions Control_Toolkit_ASF_Template/config_controllers.yml
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,6 @@ mpc:
controller_logging: true
do-mpc-discrete:
mpc_horizon: 50 # steps
num_rollouts: 1
# Initial positions
position_init: 0.0
positionD_init: 0.0
Expand All @@ -21,7 +20,6 @@ do-mpc-discrete:
do-mpc:
seed: null # If null, random seed based on datetime is used
mpc_horizon: 50 # steps
num_rollouts: 1
p_Q: 0.00 # Perturbation factors: Change of output from optimal
# Random change of cost function by factor
p_position: 0.0
Expand Down
103 changes: 71 additions & 32 deletions Controllers/__init__.py
Original file line number Diff line number Diff line change
@@ -1,20 +1,23 @@
import os
from abc import ABC, abstractmethod
from typing import Tuple
from pathlib import Path
from typing import Tuple, Union

import numpy as np
import yaml
from Control_Toolkit.others.globals_and_utils import get_logger
from SI_Toolkit.computation_library import (ComputationLibrary, NumpyLibrary,
PyTorchLibrary, TensorFlowLibrary,
TensorType)
from others.globals_and_utils import load_or_reload_config_if_modified

config_cost_function = yaml.load(open(os.path.join("Control_Toolkit_ASF", "config_cost_function.yml")), Loader=yaml.FullLoader)
logger = get_logger(__name__)
from Control_Toolkit.others.get_logger import get_logger
log = get_logger(__name__)

config_cost_function = yaml.load(open(os.path.join("Control_Toolkit_ASF", "config_cost_functions.yml")), Loader=yaml.FullLoader)

"""
For a controller to be found and imported by CartPoleGUI/DataGenerator it must:
1. Be in Controller folder
1. Be in Controllers folder
2. Have a name starting with "controller_"
3. The name of the controller class must be the same as the name of the file.
4. It must have __init__ and step methods
Expand All @@ -26,7 +29,7 @@
class template_controller(ABC):
_has_optimizer = False
# Define the computation library in your controller class or in the controller's configuration:
_computation_library: "type[ComputationLibrary]" = None
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Technically, we always work with a static reference to the computation library. That means we don't create instances of it, but always call the methods on the class itself. Therefore, the type of this attribute is type. This is really only for documentation. The code will work no matter what we put here.

Btw, the subscript type notation "type[ComputationLibrary]" in quotes is necessary for python<=3.8. In later versions one can use type[ComputationLibrary], but we maintain backward compatibility for now

_computation_library: ComputationLibrary = None

def __init__(
self,
Expand All @@ -38,22 +41,25 @@ def __init__(
initial_environment_attributes: "dict[str, TensorType]",
):
# Load controller config and select the entry for the current controller
config_controllers = yaml.load(
open(os.path.join("Control_Toolkit_ASF", "config_controllers.yml")),
Loader=yaml.FullLoader
)
(config_controllers,_) = load_or_reload_config_if_modified(os.path.join("Control_Toolkit_ASF", "config_controllers.yml")) # ignore the _ changes return since this is initial call
f=os.path.join("Control_Toolkit_ASF", "config_controllers.yml")
fp=Path(f)
log.debug(f'loading controller config from "{fp.absolute()}"')
config_controllers = yaml.load(open(f), Loader=yaml.FullLoader)
# self.controller_name is inferred from the class name, which is the class being instantiated
# Example: If you create a controller_mpc, this controller_template.__init__ will be called
# but the class name will be controller_mpc, not template_controller.
self.config_controller = dict(config_controllers[self.controller_name])
config_key=self.controller_name
self.config_controller = config_controllers[config_key]
# add timestep .dt to all controllers here
self.config_controller["dt"] = dt

# Set computation library
computation_library_name = str(self.config_controller.get("computation_library", ""))

if computation_library_name:
# Assign computation library from config
logger.info(f"Found library {computation_library_name} for MPC controller.")
log.info(f"Found library {computation_library_name} for MPC controller.")
if "tensorflow" in computation_library_name.lower():
self._computation_library = TensorFlowLibrary
elif "pytorch" in computation_library_name.lower():
Expand All @@ -67,7 +73,7 @@ def __init__(
if not issubclass(self.computation_library, ComputationLibrary):
raise ValueError(f"{self.__class__.__name__} does not have a default computation library set. You have to define one in this controller's config.")
else:
logger.info(f"No computation library specified in controller config. Using default {self.computation_library} for class.")
log.info(f"No computation library specified in controller config. Using default {self.computation_library} for class.")
self.lib = self.computation_library # Shortcut to make easy using functions from computation library, this is also used by CompileAdaptive to recognize library

# Environment-related parameters
Expand All @@ -78,15 +84,20 @@ def __init__(
self.num_control_inputs = num_control_inputs
self.control_limits = control_limits
self.action_low, self.action_high = self.control_limits


# todo these are special for cartpole but we would need a base cartpole controller class to put them there
# self.target_position=None
# self.target_equilibrium=None

# Set properties like target positions on this controller
for p, v in initial_environment_attributes.items():
if type(v) in {np.ndarray, float, int, bool}:
data_type = getattr(v, "dtype", self.lib.float32)
data_type = self.lib.int32 if data_type == int else self.lib.float32
v = self.lib.to_variable(v, data_type)
setattr(self, p, v)

for property, new_value in initial_environment_attributes.items():
setattr(self, property, self.computation_library.to_variable(new_value, self.computation_library.float32))

# set all controller config numerical values as float variables in the computation space, e.g. tensorflow, so they can be updqted during runtime
for property, value in self.config_controller.items():
if value is float or value is int:
setattr(self, property, self.computation_library.to_variable(value, self.computation_library.float32))

# Initialize control variable
self.u = 0.0

Expand All @@ -106,21 +117,30 @@ def __init__(
def configure(self, **kwargs):
# In your controller, implement any additional initialization steps here
pass
def update_attributes(self, updated_attributes: "dict[str, TensorType]"):

def update_attributes(self, updated_attributes: "dict[str,TensorType]"):
for property, new_value in updated_attributes.items():
try:
# Assume the variable is an attribute type and assign
attr = getattr(self, property)
attr = getattr(self.variable_parameters, property)
self.computation_library.assign(attr, self.lib.to_tensor(new_value, attr.dtype))
except:
setattr(self, property, new_value)
setattr(self.variable_parameters, property, new_value)

@abstractmethod
def step(self, s: np.ndarray, time=None, updated_attributes: "dict[str, TensorType]" = {}):
def step(self, state: np.ndarray, time:float=None, updated_attributes: "dict[str, Union[TensorType,float]]" = dict()):
"""
Execute one timestep of control.

:param state: the state array, dimensions are TODO add dimension to help users
:param time: the time now in seconds
:param updated_attributes: dict of updated attributes

:returns: the next control action u e.g. a normed control input in the range [-1,1] TODO is this correct?

"""
### Any computations in order to retrieve the current control. Such as:
## If the environment's target positions etc. change, copy the new attributes over to this controller so the cost function knows about it:
# self.update_attributes(updated_attributes)
# update_attributes(updated_attributes,self)
## Use some sort of optimization procedure to get your control, e.g.
# u = self.optimizer.step(s, time)
## Use the following call to populate the self.logs dictionary with savevars, such as:
Expand All @@ -132,7 +152,7 @@ def step(self, s: np.ndarray, time=None, updated_attributes: "dict[str, TensorTy
# Optionally: A method called after an experiment.
# May be used to print some statistics about controller performance (e.g. number of iter. to converge)
def controller_report(self):
logger.info("No controller report implemented for this controller. Stopping without report.")
log.info("No controller report implemented for this controller. Stopping without report.")
pass

# Optionally: reset the controller after an experiment
Expand All @@ -143,10 +163,17 @@ def controller_report(self):
def controller_reset(self):
raise NotImplementedError

@property
@property # decorates the controller so it has the field .controller_name that gets its short name which is the key in the .yml config file
def controller_name(self):
""" Generates standard name for this controller, but use this method like it were a field.

:returns: the short name which is the key to the controller in the config_controller.yml file, e.g. 'cartpole_mppi'

"""
name = self.__class__.__name__
if name != "template_controller":
if 'controller_' not in name:
raise AttributeError(f'this controller named "{name}" does not contain "controller_". Controllers should start or contain "controller_" and the key in the config_controllers.yml file should follow the underscore')
return name.replace("controller_", "").replace("_", "-").lower()
else:
raise AttributeError()
Expand All @@ -156,7 +183,7 @@ def controller_data_for_csv(self):
return {}

@property
def computation_library(self) -> "type[ComputationLibrary]":
def computation_library(self) -> ComputationLibrary:
if self._computation_library == None:
raise NotImplementedError("Controller class needs to specify its computation library")
return self._computation_library
Expand All @@ -177,6 +204,8 @@ def get_outputs(self) -> "dict[str, np.ndarray]":
}

def update_logs(self, logging_values: "dict[str, TensorType]") -> None:
""" Appends controller logging information to memory in self.logs if self.controller_logging exists, according to self.save_vars
"""
if self.controller_logging:
for name, var in zip(
self.save_vars, [logging_values.get(var_name, None) for var_name in self.save_vars]
Expand All @@ -185,3 +214,13 @@ def update_logs(self, logging_values: "dict[str, TensorType]") -> None:
self.logs[name].append(
var.numpy().copy() if hasattr(var, "numpy") else var.copy()
)

def keyboard_input(self, c):
""" process keyboard input character. Default implementation does nothing.
:param c: character
"""
pass

def print_keyboard_help(self):
""" Print help for keybaord input"""
pass
Loading