feat: switch from `edi-energy.de` and scraping to `bdew-mako.de` and a real API; drop support for Python 3.9 #261

hf-kklein · 2025-01-08T16:32:44Z

bitte einmal die mwe.py ausführen und überzeugen, dass es gut ist :)

bitte insbesondere auf die dateinamen achten, ob da aus eurer sicht alles wesentliche drin ist, zB. dass @hf-krechan @DeltaDaniel und @OLILHR daraus ableiten können, welches das richtige dokument für kohlrahbi, ahlbatross etc. ist.

because there's a real API now :)

for the api models

hf-kklein · 2025-01-08T16:33:05Z

.gitignore

@@ -2,7 +2,7 @@
 __pycache__/
 *.py[cod]
 *$py.class
-
+foo


see the mwe.py

hf-kklein · 2025-01-09T10:01:21Z

src/edi_energy_scraper/apidocument.py

+        placeholder_values = {
+            "publication_date": (self.publicationDate or self.gueltig_ab).strftime("%Y%m%d"),
+            "from_date": self.gueltig_ab.strftime("%Y%m%d"),
+            "to_date": self.gueltig_bis.strftime("%Y%m%d"),
+            "extension": self.file_extension,
+            "id": str(self.fileId),
+            "kind": self.file_kind or self.alternative_file_kind,
+            "edifact_format": (self.edifact_format + "_" if self.edifact_format else ""),
+            "version": self.document_version or "NV",
+        }
+        return "{kind}_{edifact_format}{version}_{to_date}_{from_date}_{publication_date}_{id}.{extension}".format(


hier könnt ihr frei definieren, was wir alles in den dateinamen reinstecken.
mein innerer konflikt ist:

entweder wir überladen den dateinamen mit allem was wir so brauchen

oder wir legen die (aufbereiteten) metadaten in einer extra datei neben den PDFs und DOCx ab und lassen die "clients" diese metadaten dann separat einlesen.

hf-kklein · 2025-01-09T10:02:02Z

src/edi_energy_scraper/apidocument.py

+            "edifact_format": (self.edifact_format + "_" if self.edifact_format else ""),
+            "version": self.document_version or "NV",
+        }
+        return "{kind}_{edifact_format}{version}_{to_date}_{from_date}_{publication_date}_{id}.{extension}".format(


beispiel-dateinamen findet ihr im snapshot: https://github.com/Hochfrequenz/edi_energy_scraper/blob/663f891e24545d1d556f9a25a0b200ffa562dc9d/unittests/__snapshots__/test_models.ambr

Konstantin added 9 commits January 1, 2025 14:08

Merge remote-tracking branch 'origin/main' into bdew-mako-models

ceec9f8

➖ drop beautifoul soup

2534fac

because there's a real API now :)

➕ add pydantic

56e6fd4

for the api models

restructuring WIP

4a96eaf

wip

281403c

wip

4e77746

Merge remote-tracking branch 'origin/main' into bdew-mako-models

083bf4a

wip

09dbc21

better

49a3fd6

hf-kklein commented Jan 8, 2025

View reviewed changes

.gitignore

@@ -2,7 +2,7 @@

__pycache__/

*.py[cod]

*$py.class

foo

Copy link

Contributor Author

hf-kklein Jan 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

see the mwe.py

hf-kklein requested review from lord-haffi, hf-krechan, DeltaDaniel and OLILHR January 8, 2025 16:34

Konstantin added 3 commits January 8, 2025 17:36

spell checks and documentation

6ded9ee

schande schande schande

c5d3c4e

readability

4e228ba

hf-kklein self-assigned this Jan 8, 2025

drop 3.9

663f891

hf-kklein changed the title ~~feat: switch from edi-energy.de and scraping to bdew-mako.de and a real API~~ feat: switch from edi-energy.de and scraping to bdew-mako.de and a real API; drop support for Python 3.9 Jan 8, 2025

hf-kklein commented Jan 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: switch from `edi-energy.de` and scraping to `bdew-mako.de` and a real API; drop support for Python 3.9 #261

feat: switch from `edi-energy.de` and scraping to `bdew-mako.de` and a real API; drop support for Python 3.9 #261

hf-kklein commented Jan 8, 2025 •

edited

Loading

hf-kklein Jan 8, 2025

hf-kklein Jan 9, 2025

hf-kklein Jan 9, 2025

feat: switch from edi-energy.de and scraping to bdew-mako.de and a real API; drop support for Python 3.9 #261

Are you sure you want to change the base?

feat: switch from edi-energy.de and scraping to bdew-mako.de and a real API; drop support for Python 3.9 #261

Conversation

hf-kklein commented Jan 8, 2025 • edited Loading

hf-kklein Jan 8, 2025

Choose a reason for hiding this comment

hf-kklein Jan 9, 2025

Choose a reason for hiding this comment

hf-kklein Jan 9, 2025

Choose a reason for hiding this comment

feat: switch from `edi-energy.de` and scraping to `bdew-mako.de` and a real API; drop support for Python 3.9 #261

feat: switch from `edi-energy.de` and scraping to `bdew-mako.de` and a real API; drop support for Python 3.9 #261

hf-kklein commented Jan 8, 2025 •

edited

Loading