좌표계 변환 및 다시 불러오기¶

import shapefile
from json import dumps
import geopandas as gpd
#좌표계 변환
sf=gpd.read_file('/home/ducj/data/ducj/files/json/shp/CTPRVN_201703/TL_SCCO_CTPRVN.shp',encoding='cp949')
sf=sf.to_crs(epsg='4326')
sf.to_file('/home/ducj/data/ducj/files/json/shp/CTPRVN_201703/test.shp',encoding='cp949')

#변환 후 불러오기
reader = shapefile.Reader('/home/ducj/data/ducj/files/json/shp/CTPRVN_201703/test.shp',encoding='cp949')

#shape file 을 json 파일로 변환하기 위한 코드
fields = reader.fields[1:]
field_names = [field[0] for field in fields]
buffer = []
for sr in reader.shapeRecords():
    atr = dict(zip(field_names, sr.record))
    geom = sr.shape.__geo_interface__
    name=sr.record[2]
    buffer.append(dict(type="Feature", \
     properties=atr,id=name,geometry=geom)) 

#json파일로 저장
from json import dumps
geojson = open("/home/ducj/data/ducj/files/json/shp/CTPRVN_201703/pyshp-demo.json", "w")
geojson.write(dumps({"type": "FeatureCollection",\
"features": buffer}, indent=2) + "\n")
geojson.close()

미세먼지 자료 크롤링을 위한 코드¶

from selenium import webdriver
from selenium.webdriver.common.keys import Keys
from selenium.webdriver.chrome.options import Options
from time import sleep
import pandas as pd

# create a new chrome session
options = Options()
options.add_argument('--headless')
options.add_argument('--no-sandbox')
driver = webdriver.Chrome(chrome_options=options, executable_path="/home/ducj/crawling/chromedriver")
driver.implicitly_wait(3)
driver.maximize_window()
url='https://www.airkorea.or.kr/web/sidoQualityCompare?itemCode=10007&pMENU_NO=101'
driver.get(url)
sleep(0.3)
html = driver.page_source
dfs = pd.read_html(html)
driver.close()

dfs[0]

크롤링으로 생성#54620; 데이터를 필요한 형태로 변환¶

#데이터 생성
data=pd.DataFrame([dfs[0].transpose()[1::][0].index,dfs[0].transpose()[1::][0]]).transpose()
data=pd.DataFrame({'area':data[0],'dust':data[1]})
#데이터 수치로 변환
data['dust']=data['dust'].astype(float)
data.reset_index(inplace=True,drop=True)
data=data.set_index('area')
print(data.dtypes)
data.head()

dust    float64
dtype: object

앞에서 생성한 json 파일 불러와 지역이름을 동일하게 변환¶

import folium
import json
json = json.load(open("/home/ducj/data/ducj/files/json/shp/CTPRVN_201703/pyshp-demo.json", encoding='utf-8'))

for i in range(17):
    json['features'][i]['id']=data['dust'].index[i]

# for i in range(17):
#     json['features'][i]['properties']['CTP_KOR_NM']=data['dust'].index[i]

지도형식으로 시각화¶

map=folium.Map(location=[37.5502, 126.982], tiles='Stamen toner') # 서울시
folium.Choropleth(
    geo_data=json,
    name='choropleth',
    data=data['dust'],
    key_on='feature.id',
    fill_color='YlGn',
    fill_opacity=0.7,
    line_opacity=0.2).add_to(map)

folium.LayerControl().add_to(map)

map.save('/home/ducj/nas/data.html')

def mine(): 
    import random 
    board=[[False for x in range(10)]for y in range(10)] 
    for r in range(10): 
        for c in range(10): 
            if( random.random()<0.3): 
                board[r][c]=True 

    board2=[[' ' for x in range(10)]for y in range(10)] 
    #지뢰 개수 출력 
    mine=0 
    for r in range(10): 
        for c in range(10): 
            if(board[r][c]==True): 
                mine=mine+1 
    board2=[[' ' for x in range(10)]for y in range(10)] 
    from IPython.display import clear_output 

    while True: 
        #사용자로부터 좌표 입력받기 
        x=int(input('x 좌표를 입력하시오 : '))-1 
        y=int(input('y좌표를 입력하시오 : '))-1 

        #입력된 예전에 입력되었는지 확인하기 
        if board2[x][y]==False: 
            print('잘못된 위치입니다.') 

            continue 
        else: 
        #입력 된 좌표가 지뢰이면 지뢰위치 출력하고 멈추기 
            if board[x][y]==True: 
                print('지뢰입니다.') 
                for r in range(10): 
                    for c in range(10): 
                        if board[r][c]: 
                            print('# ',end='') 
                        else: 
                            print('. ',end='') 
                    print() 
                break 
        #입력된 자료가 지뢰가 아니면 사용자가 입력했던 위치 출력하기 
            else: 
                board2[x][y]=False 
                clear_output() 
                sum=0 
                for r in range(10): 
                    for c in range(10): 
                        if board2[r][c]: 
                            print('. ',end='') 
                        else: 
                            print('x ',end='') 
                        if board2[r][c]==' ': 
                            sum=sum+1 
                    print() 
                print('지뢰수 :',mine,'남은 셀의 수 :',sum)

[Selenium] 기상자료 크롤링 (1)	2021.04.03
크롤링과 python (0)	2021.04.03
python selenium 자주쓴거 정리 (0)	2020.03.23
나라장터 open api crawling (0)	2020.03.01
selenium 사용해서 위경도 가져오기 (0)	2019.03.20

matplotlib 정리(1) (0)	2020.02.16
주피터 노트북에 메모리 사용량 모니터링 하기 (0)	2020.01.24
python 회귀분석 할 때 주로 사용할 것 같은 패키지 및 코드 (0)	2020.01.14
selenium, shape file을 활용한 미세먼지 시각화(folium 사용) (0)	2019.05.09
파이썬3.7 지뢰찾기 (0)	2019.04.08

	Unnamed: 0	서울	부산	대구	인천	광주	대전	울산	경기	강원	충북	충남	전북	전남	세종	경북	경남	제주
0	시간평균	61	41	42	55	31	40	42	66	78	47	49	32	25	51	53	35	27
1	일평균	64	40	43	55	36	42	41	68	79	49	49	34	26	48	51	36	29
2	최고값	87	96	64	73	75	56	70	99	146	73	78	64	50	61	96	66	38
3	최저값	45	27	22	39	27	26	30	42	47	23	19	1	15	40	22	21	18

	dust
area
서울	61.0
부산	41.0
대구	42.0
인천	55.0
광주	31.0

matplotlib 정리(1) (0)	2020.02.16
주피터 노트북에 메모리 사용량 모니터링 하기 (0)	2020.01.24
python 회귀분석 할 때 주로 사용할 것 같은 패키지 및 코드 (0)	2020.01.14
power shell 을 활용하여 windows에 jupyter notebook 설치하기 (0)	2019.06.27
파이썬3.7 지뢰찾기 (0)	2019.04.08

data analysis & visualization

[나라장터] 크롤링

'python > crawling' 카테고리의 다른 글

power shell 을 활용하여 windows에 jupyter notebook 설치하기

'python' 카테고리의 다른 글

selenium, shape file을 활용한 미세먼지 시각화(folium 사용)

좌표계 변환 및 다시 불러오기¶

미세먼지 자료 크롤링을 위한 코드¶

크롤링으로 생성#54620; 데이터를 필요한 형태로 변환¶

앞에서 생성한 json 파일 불러와 지역이름을 동일하게 변환¶

지도형식으로 시각화¶

'python' 카테고리의 다른 글

파이썬3.7 지뢰찾기

'python' 카테고리의 다른 글

selenium 사용해서 위경도 가져오기

'python > crawling' 카테고리의 다른 글

최근에 올라온 글

최근에 달린 댓글

공지사항

글 보관함

링크

티스토리툴바

« 2025/02 »
일	월	화	수	목	금	토
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28