Need Help for a Python Project I am working on Please Guys

→ Pay attention

Before contest
Ethflow Round 1 (Codeforces Round, Div. 1 + Div. 2)
4 days

→ Top rated

#	User	Rating
1	jiangly	4039
2	tourist	3841
3	jqdai0815	3682
4	ksun48	3590
5	ecnerwala	3542
6	Benq	3535
7	orzdevinwang	3526
8	gamegame	3477
9	heuristica	3357
10	Radewoosh	3355

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	cry	168
2	-is-this-fft-	165
3	atcoder_official	160
3	Um_nik	160
5	djm03178	158
6	Dominater069	156
7	adamant	153
8	luogu_official	152
9	awoo	151
10	TheScrasse	147

View all →

→ Find user

→ Recent actions

Detailed →

illuminati_13's blog

Need Help for a Python Project I am working on Please Guys

By illuminati_13, history, 6 months ago, In English

import requests
from bs4 import BeautifulSoup

url = "https://codeforces.net/problemset/page/11?tags=binary+search"
response = requests.get(url)
soup = BeautifulSoup(response.text, 'html.parser')

questions = soup.find_all('div', class_='problem-statement')
question_texts = [q.find('div', class_='title').text for q in questions]

total_questions = len(question_texts)
first_five_words = [q.split()[:5] for q in question_texts[:5]]
last_five_words = [q.split()[:5] for q in question_texts[-5:]]

print("Total questions:", total_questions)
print("First 5 words of the first question:", first_five_words)
print("First 5 words of the last 5 questions:", last_five_words)

I know this code isn't working bcoz its web scrapping, can anyone tell me how do I use Codeforces API for the same? To fetch all the questions under binary search and store them as a Python list. I am doing this for a Project where I have to train a Transformer model over Programming Problem Statements.

illuminati_13
6 months ago
5

Comments (5)

Write comment?

illuminati_13

6 months ago, # |

Auto comment: topic has been updated by illuminati_13 (previous revision, new revision, compare).

→ Reply

illuminati_13

6 months ago, # |

Auto comment: topic has been updated by illuminati_13 (previous revision, new revision, compare).

→ Reply

illuminati_13

6 months ago, # |

Auto comment: topic has been updated by illuminati_13 (previous revision, new revision, compare).

→ Reply

illuminati_13

6 months ago, # |

Auto comment: topic has been updated by illuminati_13 (previous revision, new revision, compare).

→ Reply

Traverser_25

6 months ago, # |

← Rev. 4 →

here is the corrected and working code without codeforces api that i have wrote ,hope it helps

import requests

from bs4 import BeautifulSoup

def getproblems(url):

problems= []


response = requests.get(url)
soup = BeautifulSoup(response.content, 'html.parser')
problem_links = soup.find_all('a', href=lambda href: href and "/problemset/problem" in href)
for link in problem_links:
    name=(link.get_text().strip())
    original_link = link['href']
    # Extracting the number and problem code from the original link
    parts = original_link.split("/")
    number = parts[-2]
    problem_code = parts[-1]
    # Constructing the full link with the correct format
    full_link = f"https://codeforces.net/problemset/status/{number}/problem/{problem_code}"
    if(name !=(number+problem_code)):
        problems.append(name)


return problems

url = "https://codeforces.net/problemset/page/11?tags=binary+search"

response = requests.get(url)

soup = BeautifulSoup(response.text, 'html.parser')

questions = getproblems(url)

total_questions = len(questions) first_five_words = [q[:5] for q in questions[:5]] last_five_words = [q[-5:] for q in questions[-5:]]

print("Total questions:", total_questions) print("First 5 words of the first question:", first_five_words) print("First 5 words of the last 5 questions:", last_five_words)

print(questions)

→ Reply