Skip to content
PCT Top applicants 2022
PCT TOP APPLICANTS 2022
Source: WIPO statistics Feb 2023 Note: Counts are based on publication date, corporate applicants only (excluding natural person) and first-named applicant.
# Importing the pandas library and assigning it the alias 'pd'
import pandas as pd1.IMPORT AND CLEAN XLSX
# Skiprows=6 - skip first 6 rows
# usecols B=>D to extract 2022 stats
data = pd.read_excel('y_top_pct_applicants.xlsx', skiprows=6,usecols="B:D")2. EDA
data.head(3)data.tail()DataFrameas
df
variable
SELECT
"ORIGIN CODE",
COUNT(*) AS "NUMBER"
FROM data
WHERE "ORIGIN CODE" = 'CN' OR "ORIGIN CODE" = 'FR'
GROUP BY "ORIGIN CODE"# Range index
# No Null
# datatype ok
data.info()data.describe()data.count()The DataFrame contains 2941 entries. There are three columns: APPLICANT'S NAME: The name of the applicant (Object data type). ORIGIN CODE: A code representing the origin of the application (Object data type). 2022 PCT APPLICATIONS: The number of PCT applications in 2022 (Integer data type). All the columns have non-null values.
data.to_excel('pct_stats_cleaned.xlsx', index=False)2. ANALYSIS