Datasources =========== The European Union Parliament uses https://www.europarl.europa.eu/plenary/en/home.html to make documents available to members of the parliament and the public. It provides the UI and features like the search functionality. The documents themselves are hosted on https://www.europarl.europa.eu/doceo/document/ and are accessible via the links from the main site or enumeration. Enumerating these documents is aided by a common naming scheme. +----------------+---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | Type | Subtype | Description | URL | URL Structure | Filetypes | Languages | +----------------+---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | Agenda | Session Agenda | Agenda overview of a complete session (multiple days) | https://www.europarl.europa.eu/doceo/document/OJ-9-2020-11-23-SYN_DE.html | /OJ-{period}-{date of first session day}-SYN-{language}.{filetype} | .html, .pdf | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | +---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | | Daily Agenda | Detailed daily agenda including agenda items, speaking times and deadlines | https://www.europarl.europa.eu/doceo/document/OJQ-9-2020-11-23_DE.html | /OJQ-{period}-{date}-{language}.{filetype} | .html, .pdf | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | +----------------+---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | Voting Results | Summarized Voting Results | Summary of the voting results, referencing the votes document | https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-06-VOT_DE.pdf | /PV-{period}-{date}-VOT_{langauge}.{filetype} | .html, .xml, .docx, .pdf | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | +---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | | Voting protocol | Named votes per voiting | https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-06-RCV_DE.pdf | /PV-{period}-{date}-VOT_{language}.{filetype} | .html, .docx | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | +----------------+---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | Word Protocol | Table of contents | Table of contents of the verbatim record | https://www.europarl.europa.eu/doceo/document/CRE-9-2020-07-09-TOC_DE.html | /CRE-{period}-{date}-TOC_{language}.{filetype} | .html | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | +---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | | Section | Verbatim record of an agenda section | https://www.europarl.europa.eu/doceo/document/CRE-9-2020-07-09-ITM-001_DE.html | /CRE-{period}-{date}-ITM-{running number, 3 digits}_{language}.{filetype} | .html, .xml | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | +---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | | Complete protocol | Verbatim record of the complete session | https://www.europarl.europa.eu/doceo/document/CRE-9-2020-07-09_DE.html | /CRE-{period}-{date}_{language}.{filetype} | .html, .xml, .pdf | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | | | | | | | The protocol isn't translated but contains only the verbatim records in the language of the speaker | +----------------+---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | Protocol | Attendance list | Attendance list of all representative | https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-19-ATT_DE.html | /PV-{period}-{date}-ATT_{language}.{filetype} | .html, .xml, .docx, .pdf | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | +---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | | Table of contents | Table of contents of the complete protocol | https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-19-TOC_DE.html | /PV-{period}-{date}-TOC_{language}.{filetype} | .html | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | +---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | | Complete Protocol | Complete protocol | https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-19_DE.html | /PV-{period}-{date}_{language}.{filetype} | .html, .xml, .docx, .pdf | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | +---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | | Section | Section of the complete protocol | https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-19-ITM-006_DE.html | /PV-{period}-{date}-ITM-{running number, 3 digits}_{language}.{filetype} | .html, .xml | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | +----------------+---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | '''Texts''' | Tabled Texts | Text entered for discussion and voting upon | https://www.europarl.europa.eu/doceo/document/A-9-2020-0181_EN.html | /A-{period}-{year}-{running number, 4 digits}_{language}.{filetype} | .html, .pdf, .word | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | +---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | | Accepted Texts TOC | Overview over texts accepted in a session | https://www.europarl.europa.eu/doceo/document/TA-9-2020-10-20-TOC_DE.html | /TA-{period}-{date}-TOC_{language}.{filetype} | .html | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | +---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | | Accepted Texts | Accepted texts in their final form | https://www.europarl.europa.eu/doceo/document/TA-9-2020-0272_DE.html | /TA-{period}-{year}-{running number, 4 digits}-{language}.{filetype} | .html, .pdf, .word | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | | +---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ | | All accepted texts | All accepted Texts | https://www.europarl.europa.eu/doceo/document/TA-9-2020-10-20_DE.html | /TA-{period}-{date}_{language}.{filetype} | .html | BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV | +----------------+---------------------------+----------------------------------------------------------------------------+--------------------------------------------------------------------------------+---------------------------------------------------------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+ All URLs to these documents can be derived directly from the session date. The exception to this is the text document type. Their reliance on a running number makes special handling necessary. Not all documents are available over the complete time. These plots show the contained content of the downloaded files. .. image:: ./images/Word_protocol_file_content_over_time.png Word protocol size over time, depending on file type. .. image:: ./images/Protocol_file_content_over_time.png Protocol size over time, depending on file type.