Merge pull request #74 from hadar-simulator/release/v0.3.0

Release/v0.3.0
hadar-simulator · Jun 24, 2020 · 6d8ce72 · 6d8ce72
2 parents fd2173e + b6d443d
commit 6d8ce72
Show file tree

Hide file tree

Showing 56 changed files with 2,688 additions and 286 deletions.
diff --git a/.github/workflows/prod.yml b/.github/workflows/prod.yml
@@ -43,8 +43,9 @@ jobs:
       run: |
         git lfs pull
         pip install -i https://test.pypi.org/simple/ hadar
+        pip install jupyter click
         cd examples
-        python3 check.py
+        python3 utils.py --src=./ --check=./
 
     - name: Release pypi.org
       run: |

diff --git a/.github/workflows/staging.yml b/.github/workflows/staging.yml
@@ -34,8 +34,9 @@ jobs:
       run: |
         git lfs pull
         export PYTHONPATH=$(pwd)
+        pip install click jupyter
         cd examples
-        python3 check.py
+        python3 utils.py --src=./ --check=./
 
     - name: Release test.pypi.org
       run: |

diff --git a/CODE_OF_CONDUCT.md b/CODE_OF_CONDUCT.md
@@ -0,0 +1,11 @@
+# Contributor Code of Conduct
+
+As contributors and maintainers of the Hadar project, we pledge to respect everyone who contributes by posting issues, updating documentation, submitting pull requests, providing feedback in comments, and any other activities.
+
+Communication through any of Hadar's channels (TODO) must be constructive and never resort to personal attacks, trolling, public or private harassment, insults, or other unprofessional conduct.
+
+We promise to extend courtesy and respect to everyone involved in this project regardless of gender, gender identity, sexual orientation, disability, age, race, ethnicity, religion, or level of experience. We expect anyone contributing to the Hadar project to do the same.
+
+If any member of the community violates this code of conduct, the maintainers of the Hadar project may take action, removing issues, comments, and PRs or blocking accounts as deemed appropriate.
+
+If you are subject to or witness unacceptable behavior, or have any other concerns, please email us at [contact@hadar-simulator.org](mailto:contact@hadar-simulator.org).
diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md
@@ -0,0 +1,4 @@
+# Contributing
+We appreciate your helps
+
+Before fillings issue, please read [Contribution Guide](https://doc.hadar-simulator.org/en/develop/dev-guide/devops.html)
diff --git a/docs/source/_static/architecture/analyzer/ulm-index.png b/docs/source/_static/architecture/analyzer/ulm-index.png
diff --git a/docs/source/_static/architecture/optimizer/lpoptimizer.png b/docs/source/_static/architecture/optimizer/lpoptimizer.png
diff --git a/docs/source/_static/architecture/optimizer/ulm-optimizer.png b/docs/source/_static/architecture/optimizer/ulm-optimizer.png
diff --git a/docs/source/_static/architecture/workflow/fault.png b/docs/source/_static/architecture/workflow/fault.png
diff --git a/docs/source/_static/architecture/workflow/monte-carlo.png b/docs/source/_static/architecture/workflow/monte-carlo.png
diff --git a/docs/source/_static/architecture/workflow/shuffler.png b/docs/source/_static/architecture/workflow/shuffler.png
diff --git a/docs/source/_static/get-started-1.png b/docs/source/_static/get-started-1.png
diff --git a/docs/source/_static/get-started-2.png b/docs/source/_static/get-started-2.png
diff --git a/docs/source/_static/get-started-3.png b/docs/source/_static/get-started-3.png
diff --git a/docs/source/architecture/analyzer.rst b/docs/source/architecture/analyzer.rst
@@ -0,0 +1,195 @@
+Analyzer
+========
+
+For a high abstraction and to be agnostic about technology, Hadar uses objects as glue for optimizer. Objects are cool, but are too complicated to manipulated for data analysis. Analyzer contains tools to help analyzing study result.
+
+Today, there is only :code:`ResultAnalyzer`, with two features level:
+
+* **high level** user asks directly to compute global cost and global remain capacity, etc.
+
+* **low level** user asks *raw* data represented inside pandas Dataframe.
+
+Before speaking about this features, let's see how data are transformed.
+
+Flatten Data
+---------
+
+As said above, object is nice to encapsulate data and represent it into agnostic form. Objects can be serialized into JSON or something else to be used by another software maybe in another language. But keep object to analyze data is awful.
+
+Python has a very efficient tool for data analysis : pandas. Therefore challenge is to transform object into pandas Dataframe. Solution used is to flatten data to fill into table.
+
+Consumption
+***********
+
+For example with consumption. Data into :code:`Study` is cost and asked quantity. And in :code:`Result` it's cost (same) and given quantity. This tuple *(cost, asked, given)* is present for each node, each consumption attached on this node, each scenario and each timestep. If we want to flatten data, we need to fill this table
+
++------+------+------+------+------+------+------+
+| cost | asked| given| node | name | scn  |  t   |
++------+------+------+------+------+------+------+
+| 10   | 5    | 5    | fr   | load | 0    |  0   |
++------+------+------+------+------+------+------+
+| 10   | 7    | 7    | fr   | load | 0    |  1   |
++------+------+------+------+------+------+------+
+| 10   | 7    | 5    | fr   | load | 1    |  0   |
++------+------+------+------+------+------+------+
+| 10   | 6    | 6    | fr   | load | 1    |  1   |
++------+------+------+------+------+------+------+
+| ...  | ...  | ...  | ...  | ...  | ..   | ...  |
++------+------+------+------+------+------+------+
+
+It is the purpose of :code:`_build_consumption(study: Study, result: Result) -> pd.Dataframe` to build this array
+
+Production
+**********
+
+Production follow the same pattern. However, they don't have *asked* and *given* but *available* and *used* quantity. Therefore table looks like
+
++------+------+------+------+------+------+------+
+| cost | avail| used | node | name | scn  |  t   |
++------+------+------+------+------+------+------+
+| 10   | 100  | 21   | fr   | coal | 0    |  0   |
++------+------+------+------+------+------+------+
+| 10   | 100  | 36   | fr   | coal | 0    |  1   |
++------+------+------+------+------+------+------+
+| 10   | 100  | 12   | fr   | coal | 1    |  0   |
++------+------+------+------+------+------+------+
+| 10   | 100  | 81   | fr   | coal | 1    |  1   |
++------+------+------+------+------+------+------+
+| ...  | ...  | ...  | ...  | ...  | ..   | ...  |
++------+------+------+------+------+------+------+
+
+It's done by :code:`_build_production(study: Study, result: Result) -> pd.Dataframe` method.
+
+
+Link
+****
+
+Link follow the same pattern. Hierarchical structure naming change. There are not *node* and *name* but *source* and *destination*. Therefore table looks like.
+
++------+------+------+------+------+------+------+
+| cost | avail| used | src  | dest | scn  |  t   |
++------+------+------+------+------+------+------+
+| 10   | 100  | 21   | fr   | uk   | 0    |  0   |
++------+------+------+------+------+------+------+
+| 10   | 100  | 36   | fr   | uk   | 0    |  1   |
++------+------+------+------+------+------+------+
+| 10   | 100  | 12   | fr   | uk   | 1    |  0   |
++------+------+------+------+------+------+------+
+| 10   | 100  | 81   | fr   | uk   | 1    |  1   |
++------+------+------+------+------+------+------+
+| ...  | ...  | ...  | ...  | ...  | ..   | ...  |
++------+------+------+------+------+------+------+
+
+It's done by :code:`_build_link(study: Study, result: Result) -> pd.Dataframe` method.
+
+
+Low level analysis
+------------------
+
+When you observe flat data, there are two kind of data. *Content* like cost, given, asked and *index* describes by node, name, scn, t.
+
+Low level API analysis provided by :code:`ResultAnalyzer` lets user to
+
+#. Organize index level, for example set time, then scenario, then name, then node.
+#. Filter index, for example just time from 10 to 150, just 'fr' node, etc
+
+User can said, *I want 'fr' node productions for first scenario to 50 until 60 timestep.* In this cas :code:`ResultAnalyzer` will return
+
++------+------+------+------+------+
+|      |      | used | cost | avail|
++------+------+------+------+------+
+| t    | name | 21   | fr   | uk   |
++------+------+------+------+------+
+| 50   | oil  | 36   | fr   | uk   |
++      +------+------+------+------+
+|      | coal | 12   | fr   | uk   |
++------+------+------+------+------+
+| 60   | oil  | 81   | fr   | uk   |
++      +------+------+------+------+
+| ...  | ...  | ...  | ...  | ...  |
++------+------+------+------+------+
+
+If first index like node and scenario has only one element, there are removed.
+
+This result can be done by this line of code. ::
+
+    agg = hd.ResultAnalyzer(study, result)
+    df = agg.agg_prod(agg.inode['fr'], agg.scn[0], agg.itime[50:60], agg.iname)
+
+As you can see, user select index hierarchy by sorting :code:`agg.ixxx` . Then user specify filter by :code:`agg.ixxx[yy]`.
+
+Behind this mechanism, there are :code:`Index` objects. As you can see directly in the code ::
+
+    @property
+    def inode(self) -> NodeIndex:
+        """
+        Get a node index to specify node slice to aggregate consumption or production.
+
+        :return: new instance of NodeIndex()
+        """
+        return NodeIndex()
+
+
+Each kind of index has to inherent from this class. :code:`Index` object encapsulate column metadata to use and range of filtered elements to keep (accessible by overriding :code:`__getitem__` method). Then, Hadar has child classes with good parameters : :code:`NameIndex` , :code:`NodeIndex` , :code:`ScnIndex` , :code:`TimeIndex` , :code:`SrcIndex` , :code:`DestIndex` . For example you can find below :code:`NodeIndex` implementation ::
+
+    class NodeIndex(Index[str]):
+        """Index implementation to filter nodes"""
+        def __init__(self):
+            Index.__init__(self, column='node')
+
+
+.. image:: /_static/architecture/analyzer/ulm-index.png
+Index instantiation are completely hidden for user. It created implicitly when user types :code:`agg.ixxx[yy]`. Then, hadar will
+
+#. check that mandatory indexes are given with :code:`_assert_index` method.
+
+#. pivot table to recreate indexing according to filter and sort asked with :code:`_pivot` method.
+
+#. remove one-size top-level index with :code:`_remove_useless_index_level` method.
+
+As you can see, low level analyze provides efficient method to extract data from adequacy study result. However data returned remains a kind of *roots* and is not ready for business purposes.
+
+High Level Analysis
+-------------------
+
+Unlike low level, high level focus on provides ready to use data. Unlike low level, features should be designed one by one for business purpose. Today we have 2 features:
+
+* :code:`get_cost(self, node: str) -> np.ndarray:` method which according to node given returns a matrix (scenario, horizon) shape with summarize cost.
+
+* :code:`get_balance(self, node: str) -> np.ndarray` method which according to node given returns a matrix (scenario, horizon) shape with exchange balance (i.e. sum of exportation minus sum of importation)
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+j