Python比較SAP BOM表數(shù)據(jù)

前言背景

年前運(yùn)用Python寫的小應(yīng)用，借此刻寫本文的機(jī)會(huì)吟榴，再次整理總結(jié)。
我們公司電子工程師會(huì)使用Mentor工具導(dǎo)出一張BOM表者吁，然后相關(guān)專員會(huì)將其手動(dòng)輸入到SAP系統(tǒng)里 ( 為何不能一鍵導(dǎo)入囊骤？）摇零。但由于數(shù)據(jù)龐雜虎敦，專員擔(dān)心手動(dòng)輸入可能有誤股淡，于是又將SAP里數(shù)據(jù)導(dǎo)出挺狰，讓工程師去比對(duì)確認(rèn)材料明郭、位號(hào)和數(shù)量有無錯(cuò)誤。
有一次看到某同事一直忙于在整理Excel她渴、比較數(shù)據(jù)达址，于是我就想能否用Pandas和PyQT5去寫個(gè)小工具自動(dòng)比較并導(dǎo)出結(jié)果？
工具界面十分簡(jiǎn)單趁耗，只要選擇SAP數(shù)據(jù)表和Mentor數(shù)據(jù)表后沉唠，點(diǎn)擊保存至Excel即可。有差異的地方按照自己想要的樣式呈現(xiàn)在Excel苛败，如此節(jié)省工程師整理比較數(shù)據(jù)時(shí)間满葛。
SAP表里可能會(huì)有個(gè)某個(gè)大類產(chǎn)品下多個(gè)型號(hào)的設(shè)備BOM數(shù)據(jù)，而Mentor 表是單一型號(hào)設(shè)備數(shù)據(jù)罢屈。SAP 表每一行都是某個(gè)位號(hào)的材料信息嘀韧，而Mentor 表里每行是某個(gè)Part No信息，2張表樣式（部分）如下：

SAP和Mentor數(shù)據(jù)表(部分內(nèi)容)

從上圖中可看出SAP和Mentor數(shù)據(jù)表里都有很多列缠捌，但我們需要比較的分別是SAP 的Material,Componet(同Mentor里的PART NO) 和Installation point(同Mentor里的REF DES)锄贷，和Mentor里的PART NO,REF DES以及COUNT译蒂。
SAP Material列篩選后有4個(gè)不同的數(shù)據(jù)，50149261-003谊却，50149261-013柔昼，50149261-023，50149261-033炎辨，它們是用于區(qū)分同類產(chǎn)品下的4個(gè)不同module的設(shè)備捕透。Mentor表文件名稱包含這些字段，如文件名50149261-033(SCBIP-24V)碴萧。

工具界面

小工具界面

SAP數(shù)據(jù)

我們Upload SAP BOM表時(shí)需要對(duì)Excel進(jìn)行簡(jiǎn)單判斷乙嘀，要求其必須含有Installation point列(Mentor不含此列)。選擇Mentor數(shù)據(jù)表時(shí)破喻，要求Excel文件名稱含關(guān)鍵字段虎谢，否則給予錯(cuò)誤提示。

    def loadSAPData(self):
        file_Path, _ = QFileDialog.getOpenFileName(self, 'Open file', "C:\\", 'Excel files(*.xlsx , *.xls)')
        # 如果用戶不選擇文件低缩，直接取消選擇嘉冒，
        if file_Path =='':
            return
        df = pd.read_excel(file_Path) #根據(jù)路徑讀取excel文件
        columns = df.columns.values.tolist() # 獲取表所有的列名
        if 'Installation point' in columns: # 判斷關(guān)鍵字在列名稱列表里
            self.label_sap_file.setText(file_Path.split('/')[-1]) #文件名顯示在界面上
            self.sap_df  = df 
            self.materials = df['Material'].unique() #列表，存放不同型號(hào)
            self.label_sap.setStyleSheet("QLabel{border-image: url(:icons/excel.png);}") #label設(shè)置圖片
        else:
            self.showMessage()  #選擇的文件不含有Installation point列咆繁，則提示錯(cuò)誤信息

    def showMessage(self):
        self.materials = [] #清空列表里的material
        messageBox = QMessageBox()
        messageBox.setWindowIcon(QIcon(':icons/error.png')) #注意為了打包后APP上能夠顯示圖片讳推，此處使用冒號(hào)，icons 是項(xiàng)目里的一個(gè)自己創(chuàng)建的文件夾
        messageBox.setWindowTitle('Error Message')
        messageBox.setText(
            "This file doesn't include a column named 'Installation point'.\nWould you like reload a file? ")
        messageBox.setStandardButtons(QMessageBox.Yes | QMessageBox.No)
        buttonY = messageBox.button(QMessageBox.Yes)
        buttonY.setText('Select File')
        buttonN = messageBox.button(QMessageBox.No)
        buttonN.setText('Cancel')
        messageBox.exec_()
        if messageBox.clickedButton() == buttonY:
            self.loadSAPData()

選擇文件錯(cuò)誤提示

根據(jù)material清洗整理SAP數(shù)據(jù)

前文中說到SAP數(shù)據(jù)表里包含多個(gè)module的數(shù)據(jù)玩般，而Mentor數(shù)據(jù)表是單個(gè)型號(hào)的數(shù)據(jù)银觅，其文件名稱里包含material字段。

首先需要根據(jù)material來篩選出不同的數(shù)據(jù)：

def getSperatedDF(self,  material):
        # 根據(jù)材料名篩選坏为，如'50149261-003'
        df = self.sap_df.loc[self.sap_df['Material'] == material]
        # 無重復(fù)的location 數(shù)量
        count_df = df.groupby('Component', sort=False)[['Installation point']].nunique().reset_index()
        # 同一顆的不同位置拼接在一起并空格分隔
        gpby_df = df.groupby('Component', sort=False).apply(lambda x: ' '.join(x['Installation point'])).reset_index()
        # 取出'Component', 'Installation point' 2 列數(shù)據(jù)
        gpby_df.columns = ['Component', 'Installation point']

        gpby_df['SAP_COUNT'] = count_df['Installation point']
        # 對(duì)位號(hào)字符串排序
        sorted_df = self.tool.getSortedLocationsDF(gpby_df,'Installation point')
        return sorted_df

接下來需要對(duì)Installation point列各單元格里雜亂的位號(hào)字符串進(jìn)行排序究驴，如將C1 C20 C4 C3排成C1 C3 C4 C20。

    def sortLocationstr(self, installation_point_str):
        if installation_point_str !='': #判斷不為空
            split_list = re.findall(r'[0-9]+|[a-zA-Z]+',installation_point_str)
            first_letters = split_list[0] #獲取第一個(gè)字母匀伏，是C/R/…
            numStr_list = list(set(split_list)) # 對(duì)所有去重洒忧，用set集合方法
            numStr_list.remove(first_letters)# 去掉字母，只留下數(shù)字
            num_list= []
            for num_str in numStr_list:
                num_list.append(int(num_str))
            num_list = sorted(num_list) #數(shù)字進(jìn)行排序
            locations = [] #位置
            for num in num_list:
                location = first_letters + str(num)
                locations.append(location)
            return ' '.join(locations)
        else:
            return  ''

根據(jù)列名稱和df文件生成排序后的文件：

   def getSortedLocationsDF(self, df, column_name):
        for i in range(df.shape[0]): # shape[0]獲取行數(shù)
            installation_point_str = df.loc[i, column_name]
            df.loc[i, column_name] = self.sortLocationstr(installation_point_str) # 對(duì)位置字符串排序
        return df

Mentor數(shù)據(jù)

    def loadMentorData(self):
        file_Path, _ = QFileDialog.getOpenFileName(self, 'Open file', "C:\\", 'Excel files(*.xlsx , *.xls)')
        # 如果用戶不選擇文件够颠，直接取消選擇熙侍，
        if file_Path == '':
            return
        df = pd.read_excel(file_Path)
        material = file_Path.split('/')[-1].split('.')[0]
        if '(' in material:
            material = material.split('(')[0]
        if material in self.materials:
            self.label_mentor_file.setText( file_Path.split('/')[-1])
            self.label_mentor.setStyleSheet("QLabel{border-image: url(:icons/excel.png);}")
            self.mentor_file_name = material
            self.mentor_df = df
            self.mentor_df = self.mentor_df[['PART NO','REF DES','COUNT']].fillna('') # 選出 'PART NO','REF DES','COUNT' 3列數(shù)據(jù)
            self.tool.getSortedLocationsDF(self.mentor_df,'REF DES') # 對(duì)'REF DES' 列單元格里的位置排序
            self.material_df = self.getSperatedDF(material)
        else:
            #以下代碼主要是彈出錯(cuò)誤提示
            self.mentor_file_name = ''
            messageBox = QMessageBox()
            messageBox.setWindowTitle('Error Message')
            messageBox.setWindowIcon(QIcon(':icons/error.png'))
            messageBox.setText(
                "This file isn't part of the SAP data. \nWould you like reload a file? ")
            messageBox.setStandardButtons(QMessageBox.Yes | QMessageBox.No)
            buttonY = messageBox.button(QMessageBox.Yes)
            buttonY.setText('Select File')
            buttonN = messageBox.button(QMessageBox.No)
            buttonN.setText('Cancel')
            messageBox.exec_()
            if messageBox.clickedButton() == buttonY:
                self.loadMentorData()

合并數(shù)據(jù)并比較

至此我們需要的material_df 和 mentor_df已經(jīng)處理完畢，接下來需要將2者merge并進(jìn)行比較履磨。

    def mergeData(self):
        compare_df = pd.merge(self.material_df,self.mentor_df,left_on='Component',right_on='PART NO',how= 'outer')
        compare_df = compare_df.fillna('')
        compare_df['Component_Bool']  = compare_df.apply(lambda x: self.tool.compare(x['Component'],x['PART NO']),axis = 1)
        compare_df['COUNT_Bool']      = compare_df.apply(lambda x: self.tool.compare(x['SAP_COUNT'],x['COUNT']),axis = 1 )
        compare_df['Location_Bool']   = compare_df.apply(lambda x: self.tool.compare(x['Installation point'],x['REF DES']),axis = 1 )
        compare_df['Location_Change'] = compare_df.apply(lambda x: self.tool.compareLocation(str(x['Installation point']), str(x['REF DES'])), axis = 1)
        return  compare_df

這里主要比較對(duì)應(yīng)的兩列內(nèi)容是否一致蛉抓，一致就返回True，否則為False剃诅。

    def compare(self, a, b):
        if a == b:
            return 'True'
        else:
            return 'False'

比較 2個(gè)位號(hào)字符串差異并返回差異結(jié)果巷送，如C2,C4,C7 和C2,C3,C7, 它們的差異就是C3和C4。

    def compareLocation(self, sap_loction, mentor_locatoin):
        if set(sap_loction.split(' ')).difference(mentor_locatoin.split(' ')) != {''}:
            location_list = list(set(sap_loction.split(' ')).difference(set(mentor_locatoin.split(' '))))
        else:
            location_list = list(set(mentor_locatoin.split(' ')).difference(set(sap_loction.split(' '))))
        return ' '.join(location_list)

Excel格式化

創(chuàng)建一個(gè)Excel 類并設(shè)置其相關(guān)格式矛辕，根據(jù)你自己的需求進(jìn)行格式修改笑跛。

import numpy as np
from openpyxl import load_workbook
from openpyxl.styles import Alignment, Side, Border, PatternFill, Font
from openpyxl.styles.colors import BLACK, RED, YELLOW


class Excel(object):

    def __init__(self,excel_path):
        super().__init__()
        self.wb = load_workbook(excel_path)
        self.ws = self.wb.active
        self.excel_path = excel_path
        self.font_name = 'Biome Light'

    def formatExcel(self):
        font        = Font(name = self.font_name , size=10, color = BLACK) #字體
        false_font  = Font(name = self.font_name, bold=True, size=10, color = BLACK) #false單元格字體
        alignment = Alignment(horizontal='center', #水平居中
                              vertical='center', #垂直居中
                              wrap_text=True,  # 文字換行
                              shrink_to_fit=False,  # 自適應(yīng)寬度付魔，改變文字大小,上一項(xiàng)false
                              )
        thin = Side(border_style="thin", color=BLACK) #邊框線顏色
        border = Border(top=thin, left=thin, right=thin, bottom=thin) #邊框線
        header_fill = PatternFill(fill_type='solid', fgColor= YELLOW ) #表頭填充顏色
        false_fill = PatternFill(fill_type='solid', fgColor= RED) #false單元格填充顏色
        header_font = Font(name=self.font_name, bold=True, size=12, color=BLACK)  #表頭單元格字體

        for row in self.ws.rows:
            for cell in row:
                cell.font = font
                cell.alignment = alignment
                cell.border = border
                if cell.value == 'False':
                    cell.fill = false_fill
                    cell.font = false_font
        # 如果是第一行，列名稱那行
        for cell in list(self.ws.rows)[0]:
            cell.fill = header_fill
            cell.font = header_font
        self.setColumnsWidth(25)
        self.wb.save(self.excel_path)

    def setColumnsWidth(self,width):
        column = self.ws.max_column  # 獲取表格列數(shù)
        numbers = np.arange(65, 65 + column)  # 大寫字母A 是65
        for i in [chr(i) for i in numbers]:  #生產(chǎn)字母A,B,C,D……
            self.ws.column_dimensions[i].width = width

導(dǎo)出Excel文件

 def saveToExcel(self):
        compare_df = self.mergeData()
        sorted_df = self.tool.getSortedLocationsDF(compare_df,'Location_Change')
        sorted_df.to_excel('{}_compare.xlsx'.format(self.mentor_file_name),sheet_name= 'compare', engine = 'openpyxl',index=False)
        excel = Excel('{}_compare.xlsx'.format(self.mentor_file_name))
        excel.formatExcel() #設(shè)置格式

對(duì)比結(jié)果（部分）

打包調(diào)用圖片

我們需要?jiǎng)?chuàng)建一個(gè)名稱為makeqrc.py的文件堡牡，運(yùn)行此文件會(huì)生成2個(gè)文件抒抬，分別是images.qrc和images.py文件。
十分重要的一點(diǎn)是晤柄，必須在導(dǎo)包的文件里import images,images 是images.py。

import subprocess, os

images = os.listdir('icons') #icons是存放圖片文件的文件夾

f = open('images.qrc', 'w+')
f.write(u'<!DOCTYPE RCC>\n<RCC version="1.0">\n<qresource>\n')
for item in images:
    f.write(u'<file alias="icons/'+ item +'">icons/'+ item +'</file>\n')
# for item in qss:
#     f.write(u'<file alias="qss/'+ item +'">qss/'+ item +'</file>\n')

f.write(u'</qresource>\n</RCC>')
f.close()
pipe = subprocess.Popen(r'pyrcc5 -o images.py images.qrc', stdout = subprocess.PIPE, stdin = subprocess.PIPE, stderr = subprocess.PIPE, creationflags=0x08)

導(dǎo)包

在Terminal 里輸入下面代碼：

pyinstaller -F -w - i icon.ico **.py

其中icon.ico為exe文件添加圖標(biāo)的文件名妖胀， -w是無黑窗口芥颈，即報(bào)錯(cuò)時(shí)看不到打印信息。

最后編輯于：2024.06.13 16:27:54

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者

人面猴
序言：七十年代末赚抡，一起剝皮案震驚了整個(gè)濱河市爬坑，隨后出現(xiàn)的幾起案子，更是在濱河造成了極大的恐慌涂臣，老刑警劉巖盾计，帶你破解...
沈念sama閱讀 217,542評(píng)論 6贊 504
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件，死亡現(xiàn)場(chǎng)離奇詭異赁遗，居然都是意外死亡署辉，警方通過查閱死者的電腦和手機(jī)，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 92,822評(píng)論 3贊 394
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進(jìn)店門岩四，熙熙樓的掌柜王于貴愁眉苦臉地迎上來哭尝，“玉大人，你說我怎么就攤上這事剖煌〔酿校” “怎么了？”我有些...
開封第一講書人閱讀 163,912評(píng)論 0贊 354
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵耕姊，是天一觀的道長(zhǎng)桶唐。經(jīng)常有香客問我，道長(zhǎng)茉兰，這世上最難降的妖魔是什么尤泽？我笑而不...
開封第一講書人閱讀 58,449評(píng)論 1贊 293
?港島之戀（遺憾婚禮）
正文為了忘掉前任，我火速辦了婚禮邦邦，結(jié)果婚禮上安吁，老公的妹妹穿的比我還像新娘。我一直安慰自己燃辖，他們只是感情好鬼店，可當(dāng)我...
茶點(diǎn)故事閱讀 67,500評(píng)論 6贊 392
惡毒庶女頂嫁案：這布局不是一般人想出來的
文/花漫我一把揭開白布。她就那樣靜靜地躺著黔龟，像睡著了一般妇智。火紅的嫁衣襯著肌膚如雪滥玷。梳的紋絲不亂的頭發(fā)上，一...
開封第一講書人閱讀 51,370評(píng)論 1贊 302
城市分裂傳說
那天巍棱，我揣著相機(jī)與錄音惑畴，去河邊找鬼。笑死航徙，一個(gè)胖子當(dāng)著我的面吹牛如贷，可吹牛的內(nèi)容都是我干的。我是一名探鬼主播到踏，決...
沈念sama閱讀 40,193評(píng)論 3贊 418
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼杠袱，長(zhǎng)吁一口氣：“原來是場(chǎng)噩夢(mèng)啊……” “哼！你這毒婦竟也來了窝稿？” 一聲冷哼從身側(cè)響起楣富，我...
開封第一講書人閱讀 39,074評(píng)論 0贊 276
萬榮殺人案實(shí)錄
序言：老撾萬榮一對(duì)情侶失蹤，失蹤者是張志新（化名）和其女友劉穎伴榔，沒想到半個(gè)月后纹蝴，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體，經(jīng)...
沈念sama閱讀 45,505評(píng)論 1贊 314
?護(hù)林員之死
正文獨(dú)居荒郊野嶺守林人離奇死亡踪少，尸身上長(zhǎng)有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點(diǎn)故事閱讀 37,722評(píng)論 3贊 335
?白月光啟示錄
正文我和宋清朗相戀三年塘安，在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了。大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片秉馏。...
茶點(diǎn)故事閱讀 39,841評(píng)論 1贊 348
活死人
序言：一個(gè)原本活蹦亂跳的男人離奇死亡耙旦，死狀恐怖，靈堂內(nèi)的尸體忽然破棺而出萝究，到底是詐尸還是另有隱情免都，我是刑警寧澤，帶...
沈念sama閱讀 35,569評(píng)論 5贊 345
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布帆竹，位于F島的核電站绕娘，受9級(jí)特大地震影響，放射性物質(zhì)發(fā)生泄漏栽连。R本人自食惡果不足惜险领，卻給世界環(huán)境...
茶點(diǎn)故事閱讀 41,168評(píng)論 3贊 328
男人毒藥：我在死后第九天來索命
文/蒙蒙一、第九天我趴在偏房一處隱蔽的房頂上張望秒紧。院中可真熱鬧绢陌，春花似錦、人聲如沸熔恢。這莊子的主人今日做“春日...
開封第一講書人閱讀 31,783評(píng)論 0贊 22
一樁弒父案，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽叙淌。三九已至秤掌，卻和暖如春愁铺，著一層夾襖步出監(jiān)牢的瞬間，已是汗流浹背闻鉴。一陣腳步聲響...
開封第一講書人閱讀 32,918評(píng)論 1贊 269
情欲美人皮
我被黑心中介騙來泰國(guó)打工茵乱，沒想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留，地道東北人孟岛。一個(gè)月前我還...
沈念sama閱讀 47,962評(píng)論 2贊 370
代替公主和親
正文我出身青樓瓶竭，卻偏偏與公主長(zhǎng)得像，于是被迫代替她去往敵國(guó)和親蚀苛。傳聞我的和親對(duì)象是個(gè)殘疾皇子在验，可洞房花燭夜當(dāng)晚...
茶點(diǎn)故事閱讀 44,781評(píng)論 2贊 354