diff --git a/.gitignore b/.gitignore new file mode 100644 index 0000000..98c8b02 --- /dev/null +++ b/.gitignore @@ -0,0 +1,3 @@ +venv +zz +*# \ No newline at end of file diff --git a/csv/2024-2028/Fachsystematik_2024-2028.csv b/csv/2024-2028/Fachsystematik_2024-2028.csv index a6b3334..5d81e8a 100644 --- a/csv/2024-2028/Fachsystematik_2024-2028.csv +++ b/csv/2024-2028/Fachsystematik_2024-2028.csv @@ -1,174 +1,203 @@ Subject Number,Subject,Review Board,Subject Area,Scientific Discipline,Fachnummer,Fach,Fachkollegium,Fachgebiet,Wissenschaftsbereich 1.11-01,Prehistory and World Archaeology,"1.11 -Ancient Cultures",11 Humanities,"1 +Ancient Cultures","11 +Humanities","1 Humanities and Social Sciences",1.11-01,Ur- und Frühgeschichte (weltweit),"1.11 Alte Kulturen","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.11-02,Greek and Latin Philology,"1.11 -Ancient Cultures",11 Humanities,"1 +Ancient Cultures","11 +Humanities","1 Humanities and Social Sciences",1.11-02,Griechische und Lateinische Philologie,"1.11 Alte Kulturen","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.11-03,Ancient History,"1.11 -Ancient Cultures",11 Humanities,"1 +Ancient Cultures","11 +Humanities","1 Humanities and Social Sciences",1.11-03,Alte Geschichte,"1.11 Alte Kulturen","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.11-04,"Classical, Roman, Christian and Islamic Archaeology","1.11 -Ancient Cultures",11 Humanities,"1 +Ancient Cultures","11 +Humanities","1 Humanities and Social Sciences",1.11-04,"Klassische, Provinzialrömische, Christliche und Islamische Archäologie","1.11 Alte Kulturen","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.11-05,Egyptology and Ancient Near Eastern Studies,"1.11 -Ancient Cultures",11 Humanities,"1 +Ancient Cultures","11 +Humanities","1 Humanities and Social Sciences",1.11-05,Ägyptische und Vorderasiatische Altertumswissenschaften,"1.11 Alte Kulturen","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.12-01,Medieval History,"1.12 -History",11 Humanities,"1 +History","11 +Humanities","1 Humanities and Social Sciences",1.12-01,Mittelalterliche Geschichte,"1.12 Geschichtswissenschaften","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.12-02,Early Modern History,"1.12 -History",11 Humanities,"1 +History","11 +Humanities","1 Humanities and Social Sciences",1.12-02,Frühneuzeitliche Geschichte,"1.12 Geschichtswissenschaften","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.12-03,Modern and Contemporary History,"1.12 -History",11 Humanities,"1 +History","11 +Humanities","1 Humanities and Social Sciences",1.12-03,Neuere und Neueste Geschichte (einschl. Europäische Geschichte der Neuzeit und Außereuropäische Geschichte),"1.12 Geschichtswissenschaften","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.12-04,History of Science,"1.12 -History",11 Humanities,"1 +History","11 +Humanities","1 Humanities and Social Sciences",1.12-04,Wissenschaftsgeschichte,"1.12 Geschichtswissenschaften","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.13-01,Art History,"1.13 - Art History, Music, Theatre and Media Studies",11 Humanities,"1 + Art History, Music, Theatre and Media Studies","11 +Humanities","1 Humanities and Social Sciences",1.13-01,Kunstgeschichte,"1.13 Kunst-, Musik-, Theater- und Medienwissenschaften","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.13-02,Musicology,"1.13 - Art History, Music, Theatre and Media Studies",11 Humanities,"1 + Art History, Music, Theatre and Media Studies","11 +Humanities","1 Humanities and Social Sciences",1.13-02,Musikwissenschaften,"1.13 Kunst-, Musik-, Theater- und Medienwissenschaften","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.13-03,Theatre and Media Studies,"1.13 - Art History, Music, Theatre and Media Studies",11 Humanities,"1 + Art History, Music, Theatre and Media Studies","11 +Humanities","1 Humanities and Social Sciences",1.13-03,Theater- und Medienwissenschaften,"1.13 Kunst-, Musik-, Theater- und Medienwissenschaften","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.14-01,"General and Comparative Linguistics, Experimental Linguistics, Typology, Non-European Languages","1.14 -Linguistics",11 Humanities,"1 +Linguistics","11 +Humanities","1 Humanities and Social Sciences",1.14-01,"Allgemeine und Vergleichende Sprachwissenschaft, Experimentelle Linguistik, Typologie, Außereuropäische Sprachen","1.14 Sprachwissenschaften","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.14-02,"Individual Linguistics, Historical Linguistics","1.14 -Linguistics",11 Humanities,"1 +Linguistics","11 +Humanities","1 Humanities and Social Sciences",1.14-02,"Einzelsprachenwissenschaften, Historische Linguistik","1.14 Sprachwissenschaften","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.14-03,"Applied Linguistics, Computational Linguistics","1.14 -Linguistics",11 Humanities,"1 +Linguistics","11 +Humanities","1 Humanities and Social Sciences",1.14-03,"Angewandte Sprachwissenschaften, Computerlinguistik","1.14 Sprachwissenschaften","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.15-01,German Medieval Studies (Medieval German Literature),"1.15 -Literary Studies",11 Humanities,"1 +Literary Studies","11 +Humanities","1 Humanities and Social Sciences",1.15-01,Germanistische Mediävistik (Ältere deutsche Literatur),"1.15 Literaturwissenschaft","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.15-02,German Literary and Cultural Studies (Modern German Literature),"1.15 -Literary Studies",11 Humanities,"1 +Literary Studies","11 +Humanities","1 Humanities and Social Sciences",1.15-02,Germanistische Literatur-und Kulturwissenschaften (Neuere deutsche Literatur),"1.15 Literaturwissenschaft","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.15-03,European and American Literary and Cultural Studies,"1.15 -Literary Studies",11 Humanities,"1 +Literary Studies","11 +Humanities","1 Humanities and Social Sciences",1.15-03,Europäische und Amerikanische Literatur- und Kulturwissenschaften,"1.15 Literaturwissenschaft","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.15-04,General and Comparative Literary Studies; Cultural Studies,"1.15 -Literary Studies",11 Humanities,"1 +Literary Studies","11 +Humanities","1 Humanities and Social Sciences",1.15-04,Allgemeine und vergleichende Literaturwissenschaft; Kulturwissenschaft,"1.15 Literaturwissenschaft","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.16-01,Social and Cultural Anthropology and Ethnology ,"1.16 -Social and Cultural Anthropology, Non-European Cultures, Jewish Studies and Religious Studies",11 Humanities,"1 +Social and Cultural Anthropology, Non-European Cultures, Jewish Studies and Religious Studies","11 +Humanities","1 Humanities and Social Sciences",1.16-01,Ethnologie und Europäische Ethnologie,"1.16 Sozial- und Kulturanthropologie, Außereuropäische Kulturen, Judaistik und Religionswissenschaft","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.16-02,Asian Studies,"1.16 -Social and Cultural Anthropology, Non-European Cultures, Jewish Studies and Religious Studies",11 Humanities,"1 +Social and Cultural Anthropology, Non-European Cultures, Jewish Studies and Religious Studies","11 +Humanities","1 Humanities and Social Sciences",1.16-02,Asienbezogene Wissenschaften,"1.16 Sozial- und Kulturanthropologie, Außereuropäische Kulturen, Judaistik und Religionswissenschaft","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.16-03,"African, American and Oceania Studies ","1.16 -Social and Cultural Anthropology, Non-European Cultures, Jewish Studies and Religious Studies",11 Humanities,"1 +Social and Cultural Anthropology, Non-European Cultures, Jewish Studies and Religious Studies","11 +Humanities","1 Humanities and Social Sciences",1.16-03,"Afrika-, Amerika- und Ozeanienbezogene Wissenschaften","1.16 Sozial- und Kulturanthropologie, Außereuropäische Kulturen, Judaistik und Religionswissenschaft","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.16-04,"Islamic Studies, Arabian Studies, Semitic Studies","1.16 -Social and Cultural Anthropology, Non-European Cultures, Jewish Studies and Religious Studies",11 Humanities,"1 +Social and Cultural Anthropology, Non-European Cultures, Jewish Studies and Religious Studies","11 +Humanities","1 Humanities and Social Sciences",1.16-04,"Islamwissenschaft, Arabistik, Semitistik","1.16 Sozial- und Kulturanthropologie, Außereuropäische Kulturen, Judaistik und Religionswissenschaft","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.16-05,Religious Studies and Jewish Studies,"1.16 -Social and Cultural Anthropology, Non-European Cultures, Jewish Studies and Religious Studies",11 Humanities,"1 +Social and Cultural Anthropology, Non-European Cultures, Jewish Studies and Religious Studies","11 +Humanities","1 Humanities and Social Sciences",1.16-05,Religionswissenschaft und Judaistik,"1.16 Sozial- und Kulturanthropologie, Außereuropäische Kulturen, Judaistik und Religionswissenschaft","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.17-01,Protestant Theology,"1.17 -Theology",11 Humanities,"1 +Theology","11 +Humanities","1 Humanities and Social Sciences",1.17-01,Evangelische Theologie,"1.17 Theologie","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.17-02,Roman Catholic Theology,"1.17 -Theology",11 Humanities,"1 +Theology","11 +Humanities","1 Humanities and Social Sciences",1.17-02,Katholische Theologie,"1.17 Theologie","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.18-01,History of Philosophy,"1.18 -Philosophy",11 Humanities,"1 +Philosophy","11 +Humanities","1 Humanities and Social Sciences",1.18-01,Geschichte der Philosophie,"1.18 Philosophie","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.18-02,Theoretical Philosophy,"1.18 -Philosophy",11 Humanities,"1 +Philosophy","11 +Humanities","1 Humanities and Social Sciences",1.18-02,Theoretische Philosophie,"1.18 Philosophie","11 Geisteswissenschaften","1 Geistes- und Sozialwissenschaften" 1.18-03,Practical Philosophy,"1.18 -Philosophy",11 Humanities,"1 +Philosophy","11 +Humanities","1 Humanities and Social Sciences",1.18-03,Praktische Philosophie,"1.18 Philosophie","11 Geisteswissenschaften","1 diff --git a/csv_parsing_errors.md b/csv_parsing_errors.md new file mode 100644 index 0000000..bbddd7b --- /dev/null +++ b/csv_parsing_errors.md @@ -0,0 +1,33 @@ +# CSV Parsing Errors & Fixes + +## Error in `11 Humanities` + + +`python scripts/create_ontology.py csv/2024-2028/Fachsystematik_2024-2028.csv` + +```python +SECTION: 0 Scientific Discipline +INDEX: 0 COL:Scientific Discipline CELL: 1 +Humanities and Social Sciences +CELL ID: <<<<1>>> +CURRENT: 1 - Humanities and Social Sciences +PARENT: <<>> +Class: https://github.com/tibonto/dfgfo/1 labels: ['Humanities and Social Sciences', 'Geistes- und Sozialwissenschaften'] + +SECTION: 1 Subject Area +INDEX: 1 COL:Subject Area CELL: 11 Humanities +Traceback (most recent call last): + File "/home/acastro/Documents/external_projects/DFG-Fachsystematik-Ontology/scripts/create_ontology.py", line 96, in + cell_id, cell_label = split_id_label(id_n_label=row[tree_hierarchy[index]]) + File "/home/acastro/Documents/external_projects/DFG-Fachsystematik-Ontology/scripts/create_ontology.py", line 34, in split_id_label + id, label = id_n_label.split('\n') +ValueError: not enough values to unpack (expected 2, got 1) +``` + +**Issue:** +Unlike other "Subject Area" values that seperate `NN Subject` with a line break, `11 Humanities` only uses space as a separator: + +**Fix:** +Search & Replace in CSV `11 Humanities` for `"11\nHumanities"` + +**Commit:** \ No newline at end of file