Dell, Linux, Ubuntu, TouchScreen, Off

2022. 1. 28. 17:23

jake_kim

1) Windows

# 참조 : https://www.dell.com/support/kbdoc/ko-kr/000123813/a-%ED%84%B0%EC%B9%98-%EC%8A%A4%ED%81%AC%EB%A6%B0-a-dell-%EC%BB%B4%ED%93%A8%ED%84%B0#touchscreen_calibration

키보드에서 Windows Key + X를 누른 뒤 장치 관리자(Device Manager)를 클릭합니다.
휴먼 인터페이스 장치(Human Interface Devices) 필드를 확장합니다.
HID 규격 터치 스크린을 마우스 오른쪽 버튼으로 클릭 합니다.
사용 안 함을 클릭합니다.

2) Linux ( Ubuntu )

# 참조 : https://www.dell.com/support/kbdoc/ko-kr/000129814/dell-pc%EC%9D%98-ubuntu-16-04%EC%97%90%EC%84%9C-%ED%84%B0%EC%B9%98%ED%8C%A8%EB%93%9C-%EB%B0%8F-%EA%B8%B0%ED%83%80-%EC%9E%85%EB%A0%A5-%EC%9E%A5%EC%B9%98-%EB%AC%B8%EC%A0%9C-%ED%95%B4%EA%B2%B0

#

> xinput list

# find id(number) of "ELAN Touchscreen" at list .

> xinput --disable [id]

ISP, ISMP, RFP, FP (0)	2022.03.04
전자정부표준프레임워크 (eGovFrame) (0)	2022.03.04
MAU, MCU, DAU, ACU (0)	2022.01.21
Fluent API, 빌더 패턴 (0)	2021.11.15
Template Meta Programming (0)	2021.11.14

비정형 데이터 분석.참고자료

2022. 1. 26. 17:05

jake_kim

보호되어 있는 글입니다.
내용을 보시려면 비밀번호를 입력하세요.

비정형 데이터 분석

2022. 1. 26. 16:34

jake_kim

보호되어 있는 글입니다.
내용을 보시려면 비밀번호를 입력하세요.

Github Action 사용법

2022. 1. 25. 13:07

jake_kim

Git, Github

# 참조 : https://zzsza.github.io/development/2020/06/06/github-action/

'Git, Github' 카테고리의 다른 글

(작성중) Git, Github (0)	2021.12.02

,

KoNLPy, TF-IDF Vectorizor

2022. 1. 24. 17:41

jake_kim

형태소분석,Lucene,mecap,komoran,tf-idf vector

# 참조 : https://konlpy.org/ko/v0.5.2/install/

# 참조 : http://doc.mindscale.kr/km/unstructured/03.html

# 참조 : https://dianakang.tistory.com/48

형태소 분석기는 형태소를 분석해주는 프로그램이다. konlpy는 Python에서 사용할 수 있는 오픈소스 형태소 분석기로, 기존에 공개된 꼬꼬마(Kkma), 코모란(Komoran), 한나눔(Hannanum), 트위터(Twitter), 메카브(Mecab)를 한 번에 설치하고 동일한 방법으로 쓸 수 있게 해준다. (단 메카브는 윈도에서 사용할 수 없다)

sudo apt-get install g++ openjdk-8-jdk python3-dev python3-pip curl

python3 -m pip install --upgrade pip

python3 -m pip install konlpy

sudo apt-get install curl git

bash <(curl -s https://raw.githubusercontent.com/konlpy/konlpy/master/scripts/mecab.sh)

# 도커 기반일 경우
> FROM python:3

> ENV JAVA_HOME /usr/lib/jvm/java-1.7-openjdk/jre
> RUN apt-get update && apt-get install -y g++ default-jdk
> RUN pip install konlpy

> # Write left part as you want

# 다운로드 
https://archive.ics.uci.edu/ml/datasets/Sentiment+Labelled+Sentences

> vi amazonpkl.py
import pandas as pd

df = pd.read_csv('amazon_cells_labelled.txt', sep="\t", header=None)

content = df[0]
sentiment = df[1]

from sklearn.feature_extraction.text import TfidfVectorizer
tfidf = TfidfVectorizer(stop_words='english',
                        lowercase=True,
                        max_features=1000)
tdm = tfidf.fit_transform(content)
print(tfidf.get_feature_names()[-10:])

import joblib
with open('amazon.pkl', 'wb') as f:
    joblib.dump(
        {'vectorizer': tfidf, 'tdm': tdm, 'sentiment': sentiment},
        f
    )





> python amazonpkl.py

> vi amazonpklinfo.py
import joblib

with open('amazon.pkl', 'rb') as f:
    data = joblib.load(f)

locals().update(data)

count = tdm.sum(axis=0)

import pandas as pd
word_count = pd.DataFrame({
    '단어': vectorizer.get_feature_names(),
    '빈도': count.flat})

print(word_count.tail())




> python amazonpklinfo.py

'형태소분석,Lucene,mecap,komoran,tf-idf vector' 카테고리의 다른 글

KOMORAN 설치 (0)	2022.01.24
Mecab Kor 사용법 (0)	2022.01.24
기본 설치 및 테스트 (0)	2019.04.25

,

KOMORAN 설치

2022. 1. 24. 17:37

jake_kim

형태소분석,Lucene,mecap,komoran,tf-idf vector

# KoNLPy 를 설치하면 KOMORAN 까지 한방에 다 해결되니 아래 링크로 보자.

https://storycode.tistory.com/466

# 참조 : https://www.shineware.co.kr/products/komoran/#demo?utm_source=komoran-kr&utm_medium=Referral&utm_campaign=github-demo

# https://github.com/shineware/PyKOMORAN
pip install PyKomoran

# 혹은

git clone https://github.com/shineware/PyKOMORAN
cd PyKOMORAN/python
sudo python setup.py install

sudo apt install openjdk-11-jre-headless

> vi test.py

from PyKomoran import *

komoran = Komoran("EXP")
print(komoran.get_plain_text("KOMORAN은 한국어 형태소 분석기입니다."))
print(komoran.get_plain_text("① 대한민국은 민주공화국이다."))

komoran = Komoran("STABLE")
print(komoran.get_plain_text("KOMORAN은 한국어 형태소 분석기입니다."))
print(komoran.get_plain_text("① 대한민국은 민주공화국이다."))

> python test.py

# EXP ( = Full )
KOMORAN/SL 은/JX 한국어/NNP 형태소/NNP 분석기/NNG 이/VCP ㅂ니다/EF ./SF
①/SW 대한민국/NNP 은/JX 민주공화국/NNP 이/VCP 다/EF ./SF
# STABLE ( = Light )
KOMORAN/SL 은/JX 한국어/NNP 형태소/NNG 분석기/NNG 이/VCP ㅂ니다/EF ./SF
①/SW 대한민국/NNP 은/JX 민주/NNG 공화국/NNG 이/VCP 다/EF ./SF

'형태소분석,Lucene,mecap,komoran,tf-idf vector' 카테고리의 다른 글

KoNLPy, TF-IDF Vectorizor (0)	2022.01.24
Mecab Kor 사용법 (0)	2022.01.24
기본 설치 및 테스트 (0)	2019.04.25

,

Mecab Kor 사용법

2022. 1. 24. 15:28

jake_kim

형태소분석,Lucene,mecap,komoran,tf-idf vector

tar xvfz mecab-ko-dic-2.1.1-20180720.tar.gz
cd mecab-ko-dic-2.1.1-20180720
./configure
make
sudo make install

# 참조 : https://bitbucket.org/eunjeon/mecab-ko-dic/src/master/

# 참조 : https://mondayus.tistory.com/46

> cd download

1) download 3 files

wget https://bitbucket.org/eunjeon/mecab-ko/downloads/mecab-0.996-ko-0.9.2.tar.gz

wget https://bitbucket.org/eunjeon/mecab-ko-dic/downloads/mecab-ko-dic-2.1.1-20180720.tar.gz

git clone https://bitbucket.org/eunjeon/mecab-python-0.996.git

2) mecab-0.996

tar xvfz mecab-0.996-ko-0.9.2.tar.gz
cd mecab-0.996-ko-0.9.2
./configure
make
make check
sudo make install

# sudo ldconfig
export LD_LIBRARY_PATH=/usr/local/lib:$LD_LIBRARY_PATH

3) mecab-ko-dic

tar xvfz mecab-ko-dic-2.1.1-20180720.tar.gz
cd mecab-ko-dic-2.1.1-20180720
./configure
make
sudo make install

# 아래 디렉토리에 설치
/usr/local/lib/mecab/dic/mecab-ko-dic

# Jupyter 에서 사용 가능

from konlpy.tag import Mecab
mecab = Mecab()

4) 테스트

$ mecab -d /usr/local/lib/mecab/dic/mecab-ko-dic
mecab-ko-dic은 MeCab을 사용하여, 한국어 형태소 분석을 하기 위한 프로젝트입니다.
mecab    SL,*,*,*,*,*,*,*
-    SY,*,*,*,*,*,*,*
ko    SL,*,*,*,*,*,*,*
-    SY,*,*,*,*,*,*,*
dic    SL,*,*,*,*,*,*,*
은    JX,*,T,은,*,*,*,*
MeCab    SL,*,*,*,*,*,*,*
을    JKO,*,T,을,*,*,*,*
사용    NNG,행위,T,사용,*,*,*,*
하    XSV,*,F,하,*,*,*,*
여    EC,*,F,여,*,*,*,*
,    SC,*,*,*,*,*,*,*
한국어    NNG,*,F,한국어,Compound,*,*,한국/NNG/*+어/NNG/*
형태소    NNG,*,F,형태소,Compound,*,*,형태/NNG/*+소/NNG/*
분석    NNG,행위,T,분석,*,*,*,*
을    JKO,*,T,을,*,*,*,*
하    VV,*,F,하,*,*,*,*
기    ETN,*,F,기,*,*,*,*
위한    VV+ETM,*,T,위한,Inflect,VV,ETM,위하/VV/*+ᆫ/ETM/*
프로젝트    NNG,*,F,프로젝트,*,*,*,*
입니다    VCP+EF,*,F,입니다,Inflect,VCP,EF,이/VCP/*+ᄇ니다/EF/*
.    SF,*,*,*,*,*,*,*
EOS

5) 기타 확인

품사확인)

https://docs.google.com/spreadsheets/d/1-9blXKjtjeKZqsf4NzHeYJCrr49-nXeRF6D80udfcwY/edit#gid=589544265

6) mecab python

cd mecab-python-0.996
python setup.py build
su

# 이제 Jupyter 에서 정상적으로 Mecab 사용 가능

from konlpy.tag import Mecab
mecab = Mecab()

# 추가

# 사용자 사전 등록 및 우선순위 높이기

참고 : https://bitbucket.org/eunjeon/mecab-ko-dic/src/df15a487444d88565ea18f8250330276497cc9b9/final/user-dic/README.md

참고 : https://kugancity.tistory.com/entry/mecab에-사용자사전기분석-추가하기

- 표층형 : 단어명
- 1785,3543,4267 : 좌문맥ID, 우문맥ID, 단어비용 (자동 생성 처리로 입력)
- 품사태그 : 품사 입력 (mecab-ko-dic 품사 태그를 참조하여 입력)
- 의미분류 : 인명 또는 지명 또는 *
- F : 받침유무 (원 단어의 끝 글자 받침 유무로 T, F 입력)
- 읽기 : 발음 (원 단어의 발음을 입력)
- 타입 : inflected, compound, Preanalysis, *  
- 첫번째 품사, 마지막 품사 : 기분석으로 나눠지는 토큰에 대한 각 품사 입력 (mecab-ko-dic 품사 태그를 참조하여 입력)
- 원형 : 토큰 들로 나눠지는 부분 +로 입력 ( 각 토큰 : 표층형/품사태그/의미분류 ) 
- 인덱스표현  : 토큰 들로 나눠지는 부분 +로 입력 ( 각 토큰:  표층형/품사태그/의미부류/PositionIncrementAttribute/PositionLengthAttribute)

1. 사전 등록

- mecab-ko-dic 설치 위치로 가서 nnp.csv에 사용자 사전 입력

>> cd ~/mecab-ko-dic-2.1.1-20180720/user-dic
>> vi nnp.csv

다음 형태로 입력
구르미그린달빛,0,0,0,NNP,*,T,구르미그린달빛,*,*,*,*
고프로,0,0,0,NNP,*,F,고프로,*,*,*,*
썸즈업,0,0,0,NNP,*,T,썸즈업,*,*,*,*
썬팍,0,0,0,NNP,*,T,썬팍,*,*,*,*

- 적용 전 테스트

- 사전 적용

tools>> ./add-userdic.sh

mecab-ko-dic-2.1.1-20180720 >> make install

(base) ➜  mecab-ko-dic-2.1.1-20180720 cd tools
(base) ➜  tools ./add-userdic.sh
(base) ➜  mecab-ko-dic-2.1.1-20180720 make install
generating userdic...
nnp.csv
.
.
.
emitting matrix      : 100% |###########################################|

done!
echo To enable dictionary, rewrite /usr/local/etc/mecabrc as \"dicdir = /usr/local/lib/mecab/dic/mecab-ko-dic\"
To enable dictionary, rewrite /usr/local/etc/mecabrc as "dicdir = /usr/local/lib/mecab/dic/mecab-ko-dic"
(base) ➜  tools

- 사전 적용 후 테스트

사전은 적용 되었지만 등록한 단어의 우선순위(단어비용)가 낮아서, 원하지 않는 모양으로 분해 되는 것을 확인함. (단어비용을 낮추면 검색 우선순위가 높아짐)
현재 적용된 단어 비용

구르미그린달빛의 단어비용 : 2953
구르/미/그린/달빛 단어비용 : 2679/3464/1247/2215

2. 우선 순위 적용 (단어비용 낮추기)

- mecab-ko-dic-2.1.1-20180720 >> vi user-nnp.csv

- 2953 → 0 변경 후 저장

- 변경 사항 반영
mecab-ko-dic-2.1.1-20180720 >> sudo make clean
mecab-ko-dic-2.1.1-20180720 >> sudo make install

- 결과 확인

'형태소분석,Lucene,mecap,komoran,tf-idf vector' 카테고리의 다른 글

KoNLPy, TF-IDF Vectorizor (0)	2022.01.24
KOMORAN 설치 (0)	2022.01.24
기본 설치 및 테스트 (0)	2019.04.25

,

MAU, MCU, DAU, ACU

2022. 1. 21. 09:48

jake_kim

IT 용어, 인터넷 활용 등등

# https://brunch.co.kr/@userhabit/19

# https://brunch.co.kr/@supims/165

MAU = Monthly Activity User

DAU = Daily Activity User

MCU = Maximum Current User

ACU = Average Current User

전자정부표준프레임워크 (eGovFrame) (0)	2022.03.04
Dell, Linux, Ubuntu, TouchScreen, Off (0)	2022.01.28
Fluent API, 빌더 패턴 (0)	2021.11.15
Template Meta Programming (0)	2021.11.14
JWT ( JSON Web Token ) (0)	2021.08.19

Docker 설치

2022. 1. 18. 15:29

jake_kim

Database 관리/PostgreSQL

1. hub.docker.com> search offiicial immage for postgresql > click > find command like "docker pull postgres"

2. docker pull postgres

3. docker run -d -p 5432:5432 -e POSTGRES_PASSWORD="<YourStrong@Passw0rd>" --name PostgreSQL01 postgres

4.

'Database 관리 > PostgreSQL' 카테고리의 다른 글

속도.postgres vs mariad (0)	2022.06.15
설치/기본사용.우분투 (0)	2022.06.15
필독) 특이 SQL 및 PLAN, Access 방식, 지원 Join, Query Rewrite (0)	2021.12.30

,

Install mariadb.10.6.tar.gz in ubuntu

2022. 1. 18. 13:28

jake_kim

Database 관리/Maria DB

# 참조 : https://dbwriter.io/installation-mariadb10/

1) 다운로드

https://downloads.mariadb.org/

2) 압축해제 및 mysql 사용자 추가

sudo groupadd mysql
sudo useradd -g mysql mysql
cd /usr/local
sudo tar -zxvpf mariadb-10.6.4-linux-systemd-x86_64.tar.gz
sudo ln -s mariadb-10.6.4-linux-systemd-x86_64 mysql

3) vi /etc/my.cnf

[client-server]
port=3306

# This will be passed to all MariaDB clients
[client]

# The MariaDB server
[mysqld]
# Directory where you want to put your data
datadir=/usr/local/mysql/data

character-set-server = utf8mb4
init_connect = SET collation_connection = utf8mb4_general_ci
init_connect = SET NAMES utf8mb4

# This is the prefix name to be used for all log, error and replication files
log-basename=mariadb

4) 초기화

cd /usr/local/mysql
sudo ./scripts/mysql_install_db --user=mysql

5) 디렉토리 권한 변경

/usr/local/mysql> sudo chown -R root .
/usr/local/mysql> sudo chown -R mysql data

6) 서버 기동

sudo ./bin/mysqld_safe --user=mysql &
or
sudo ./bin/mysqld_safe --defaults-file=~/.my.cnf --user=mysql &

7) 오류발생시

# ERROR 2002 (HY000): Can't connect to local server through socket '/var/run/mysqld/mysqld.sock' (2)

sudo mkdir /var/run/mysqld

sudo ln - /tmp/mysql.sock /var/run/mysqld/mysqld.sock

8)

> mysql

Welcome to the MariaDB monitor. Commands end with ; or \g.
Your MariaDB connection id is 3
Server version: 10.6.4-MariaDB MariaDB Server

Copyright (c) 2000, 2018, Oracle, MariaDB Corporation Ab and others.

Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.

MariaDB [(none)]> select user();

+------------------+
| user() |
+------------------+
| user@localhost |
+------------------+

9) sudo ./mysql_secure_installation

Switch to unix_socket authentication [Y/n] n
Change the root password? [Y/n] Y
New password: "ROOT 비번 설정"
Re-enter new password:
Password updated successfully!
Remove anonymous users? [Y/n] Y
Disallow root login remotely? [Y/n] n
Remove test database and access to it? [Y/n] n
Reload privilege tables now? [Y/n] Y
... Success!

Cleaning up...

All done! If you've completed all of the above steps, your MariaDB
installation should now be secure.

Thanks for using MariaDB!

10)

mysql> select host, user from mysql.user;

mysql> create user ml@localhost identified by 'ml';

mysql> flush privileges;

mysql> grant all privileges on *.* to ml@'localhost' identified by 'ml';

mysql> grant all privileges on *.* to ml@'%' identified by 'ml';
mysql> flush privileges;

11)

show variables like '%set%';

SELECT schema_name, default_character_set_name, DEFAULT_COLLATION_NAME FROM information_schema.SCHEMATA ;

'Database 관리 > Maria DB' 카테고리의 다른 글

Data Directory 변경 (0)	2023.05.26
binlog 삭제 (0)	2023.01.16
Lock, 락, 해결 (0)	2021.10.29
RDS, export, mysqldump/ import (0)	2021.10.14
[편법] CVS Parsing, 파싱 (0)	2021.09.23

,

StoryCode

'2022/01'에 해당되는 글 19건

Dell, Linux, Ubuntu, TouchScreen, Off

'IT 용어, 인터넷 활용 등등' 카테고리의 다른 글

비정형 데이터 분석.참고자료

비정형 데이터 분석

Github Action 사용법

'Git, Github' 카테고리의 다른 글

KoNLPy, TF-IDF Vectorizor

'형태소분석,Lucene,mecap,komoran,tf-idf vector' 카테고리의 다른 글

KOMORAN 설치

'형태소분석,Lucene,mecap,komoran,tf-idf vector' 카테고리의 다른 글

Mecab Kor 사용법

# 사용자 사전 등록 및 우선순위 높이기

1. 사전 등록

2. 우선 순위 적용 (단어비용 낮추기)

'형태소분석,Lucene,mecap,komoran,tf-idf vector' 카테고리의 다른 글

MAU, MCU, DAU, ACU

'IT 용어, 인터넷 활용 등등' 카테고리의 다른 글

Docker 설치

'Database 관리 > PostgreSQL' 카테고리의 다른 글

Install mariadb.10.6.tar.gz in ubuntu

'Database 관리 > Maria DB' 카테고리의 다른 글

티스토리툴바

« 2022/01 »
일	월	화	수	목	금	토
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31