Телеграмм чат группы scrapy_python страница 1978

parrot.ru

Рейтинг популярных групп и каналов

В рейтинге участвует:

групп:

каналов:

Виртуальный сервер на SSD - недорого!

Аренда выделенных и виртуальных серверов (VDS/VPS), хостинг, аренда IP-адресов, администрирование, круглосуточная поддержка

qwarta.ru подробнее

Резервное копирование с проверкой на вирусы!!!

Удобный сервис создания резервных копий на любой сервер сети интернет. Отслеживайте изменения, проверяйте на вирусы. Надежно защитите свой бизнес!

go.backupland.com

Выбираете сервер? Любая конфигурация на заказ!

Аренда физических серверов любых конфигураций под любые запросы - 1С бухгалтерия, игровые сервера, нагруженные проекты, интернет-магазины!

qwarta.ru подробнее

Size: a a a

Scrapy

627 membersпожаловаться на группу

2020 September 07

YB

Yaswanth Bangaru in Scrapy

https://pastebin.com/M8cSG59T

I am trying to print the json response of an xhr post request, I literally gave the scrapy FormRequest everything the browser does, as in almost all the necessary Request Headers, Form Data (pretty much as it is, didn't leave anything) and Query String Parameters as well. When I run the spider, it throws as error " 'object, got %s' % type(text).__name__)" . When I looked into it, the potential resolution was to use the custom setting 'ROBOTSTXT_OBEY' : False in the custom settings dict in the spider. It's not helping however, can someone help me figure out where am I wrong?

The actual link I am trying to fetch the json from is

https://www.188bet.com/en-gb/sports/football/competition/full-time-asian-handicap-and-over-under?competitionids=26726,26326,27325,26470,26766,72584,72585,72586,72587,28288,27760,27068,27487,29490,27436,38803,29198,27111,32599,72684,26854,26664,29083,30674,27938,43960,27202,29061,99368,26146,28586,26919,29274,28649,29045,26216,26713,27904,26538,46886,28366,26380,29599,27099,26213,26814,26526,28487,26619,29042,27161,41440,26218,26986,27780,26408,26862,103068,28571

.

# -*- coding: utf-8 -*-import scrapyimport jsonclass A188betSpider(scrap - Pastebin.com

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

источник

23:41пожаловаться #1

YB

Yaswanth Bangaru in Scrapy

источник

23:42пожаловаться #2

YB

Yaswanth Bangaru in Scrapy

источник

23:43пожаловаться #3

YB

Yaswanth Bangaru in Scrapy

Same as the pastebin link, just in case👆

источник

23:43пожаловаться #4

AR

Andrey Rahmatullin in Scrapy

passing all headers explicitly is wrong, for example Content-Length must be calculated by Scrapy

источник

23:43пожаловаться #5

К

Кирилл in Scrapy

Yaswanth Bangaru

https://pastebin.com/M8cSG59T

I am trying to print the json response of an xhr post request, I literally gave the scrapy FormRequest everything the browser does, as in almost all the necessary Request Headers, Form Data (pretty much as it is, didn't leave anything) and Query String Parameters as well. When I run the spider, it throws as error " 'object, got %s' % type(text).__name__)" . When I looked into it, the potential resolution was to use the custom setting 'ROBOTSTXT_OBEY' : False in the custom settings dict in the spider. It's not helping however, can someone help me figure out where am I wrong?

The actual link I am trying to fetch the json from is

https://www.188bet.com/en-gb/sports/football/competition/full-time-asian-handicap-and-over-under?competitionids=26726,26326,27325,26470,26766,72584,72585,72586,72587,28288,27760,27068,27487,29490,27436,38803,29198,27111,32599,72684,26854,26664,29083,30674,27938,43960,27202,29061,99368,26146,28586,26919,29274,28649,29045,26216,26713,27904,26538,46886,28366,26380,29599,27099,26213,26814,26526,28487,26619,29042,27161,41440,26218,26986,27780,26408,26862,103068,28571

.

# -*- coding: utf-8 -*-import scrapyimport jsonclass A188betSpider(scrap - Pastebin.com

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

Show the full error

источник

23:45пожаловаться #6

YB

Yaswanth Bangaru in Scrapy

I changed that, still the same error. The full traceback shows that the second url it's scraping is a totally different url than the one I was expecting

источник

23:46пожаловаться #7

YB

Yaswanth Bangaru in Scrapy

Show the full error

1 sec

источник

23:46пожаловаться #8

YB

Yaswanth Bangaru in Scrapy

источник

23:50пожаловаться #9

YB

Yaswanth Bangaru in Scrapy

источник

23:50пожаловаться #10

i

Maybe this? https://stackoverflow.com/questions/44068537/getting-error-while-implementing-headers-body-in-scrapy-spider/44082448

getting error while implementing headers, body in Scrapy Spider

When trying to scrap a page passing headers and body i get the following error show below.

i tried converting to json, str and sending it but it doesn't give any results.
please let me know if any...

источник

23:50пожаловаться #11

YB

Yaswanth Bangaru in Scrapy

Maybe this? https://stackoverflow.com/questions/44068537/getting-error-while-implementing-headers-body-in-scrapy-spider/44082448

getting error while implementing headers, body in Scrapy Spider

When trying to scrap a page passing headers and body i get the following error show below.

i tried converting to json, str and sending it but it doesn't give any results.
please let me know if any...

Hey, added that as custom setting in the crawler itself

источник

23:52пожаловаться #12

YB

Yaswanth Bangaru in Scrapy

*spider itself

источник

23:52пожаловаться #13

i

Ah, I see

источник

23:54пожаловаться #14

i

What below that last line of traceback?

источник

23:55пожаловаться #15

i

Where "object got... "

источник

23:55пожаловаться #16

YB

Yaswanth Bangaru in Scrapy

источник

23:56пожаловаться #17

YB

Yaswanth Bangaru in Scrapy

My bad, thought I shared everything

источник

23:56пожаловаться #18

i

Maybe in formdata, you should set all those 1, 0, -1 etc in quotes

источник

23:59пожаловаться #19

2020 September 08

i

Error says it got int instead of str on converting to bytes

источник

00:00пожаловаться #20