Datasources

The European Union Parliament uses https://www.europarl.europa.eu/plenary/en/home.html to make documents available to members of the parliament and the public. It provides the UI and features like the search functionality. The documents themselves are hosted on https://www.europarl.europa.eu/doceo/document/ and are accessible via the links from the main site or enumeration. Enumerating these documents is aided by a common naming scheme.

Type

Subtype

Description

URL

URL Structure

Filetypes

Languages

Agenda

Session Agenda

Agenda overview of a complete session (multiple days)

https://www.europarl.europa.eu/doceo/document/OJ-9-2020-11-23-SYN_DE.html

/OJ-{period}-{date of first session day}-SYN-{language}.{filetype}

.html, .pdf

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Daily Agenda

Detailed daily agenda including agenda items, speaking times and deadlines

https://www.europarl.europa.eu/doceo/document/OJQ-9-2020-11-23_DE.html

/OJQ-{period}-{date}-{language}.{filetype}

.html, .pdf

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Voting Results

Summarized Voting Results

Summary of the voting results, referencing the votes document

https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-06-VOT_DE.pdf

/PV-{period}-{date}-VOT_{langauge}.{filetype}

.html, .xml, .docx, .pdf

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Voting protocol

Named votes per voiting

https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-06-RCV_DE.pdf

/PV-{period}-{date}-VOT_{language}.{filetype}

.html, .docx

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Word Protocol

Table of contents

Table of contents of the verbatim record

https://www.europarl.europa.eu/doceo/document/CRE-9-2020-07-09-TOC_DE.html

/CRE-{period}-{date}-TOC_{language}.{filetype}

.html

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Section

Verbatim record of an agenda section

https://www.europarl.europa.eu/doceo/document/CRE-9-2020-07-09-ITM-001_DE.html

/CRE-{period}-{date}-ITM-{running number, 3 digits}_{language}.{filetype}

.html, .xml

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Complete protocol

Verbatim record of the complete session

https://www.europarl.europa.eu/doceo/document/CRE-9-2020-07-09_DE.html

/CRE-{period}-{date}_{language}.{filetype}

.html, .xml, .pdf

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV The protocol isn’t translated but contains only the verbatim records in the language of the speaker

Protocol

Attendance list

Attendance list of all representative

https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-19-ATT_DE.html

/PV-{period}-{date}-ATT_{language}.{filetype}

.html, .xml, .docx, .pdf

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Table of contents

Table of contents of the complete protocol

https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-19-TOC_DE.html

/PV-{period}-{date}-TOC_{language}.{filetype}

.html

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Complete Protocol

Complete protocol

https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-19_DE.html

/PV-{period}-{date}_{language}.{filetype}

.html, .xml, .docx, .pdf

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Section

Section of the complete protocol

https://www.europarl.europa.eu/doceo/document/PV-9-2020-10-19-ITM-006_DE.html

/PV-{period}-{date}-ITM-{running number, 3 digits}_{language}.{filetype}

.html, .xml

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

‘’’Texts’’’

Tabled Texts

Text entered for discussion and voting upon

https://www.europarl.europa.eu/doceo/document/A-9-2020-0181_EN.html

/A-{period}-{year}-{running number, 4 digits}_{language}.{filetype}

.html, .pdf, .word

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Accepted Texts TOC

Overview over texts accepted in a session

https://www.europarl.europa.eu/doceo/document/TA-9-2020-10-20-TOC_DE.html

/TA-{period}-{date}-TOC_{language}.{filetype}

.html

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

Accepted Texts

Accepted texts in their final form

https://www.europarl.europa.eu/doceo/document/TA-9-2020-0272_DE.html

/TA-{period}-{year}-{running number, 4 digits}-{language}.{filetype}

.html, .pdf, .word

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

All accepted texts

All accepted Texts

https://www.europarl.europa.eu/doceo/document/TA-9-2020-10-20_DE.html

/TA-{period}-{date}_{language}.{filetype}

.html

BG, ES, CS, DA, DE, ET, EL, EN, FR, HR, IT, LV, LT, HU, MT, NL, PL, PT, RO, SK, SL, FI, SV

All URLs to these documents can be derived directly from the session date. The exception to this is the text document type. Their reliance on a running number makes special handling necessary.

Not all documents are available over the complete time. These plots show the contained content of the downloaded files.

../_images/Word_protocol_file_content_over_time.png

Word protocol size over time, depending on file type.

../_images/Protocol_file_content_over_time.png

Protocol size over time, depending on file type.